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Abstract 

We consider a meanfield system of interacting Fisher- Wright diffusions with selection and 
rare mutation on the geographic space {1, 2, • ■ • , A'^}. The type 1 has fitness 0, type 2 has 
fitness 1 and (rare) mutation occurs from type 1 to 2 at rate m ■ A'^"^, selection is at rate 
s > 0. The system starts in the state concentrated on type 1, the state of low fitness. We 
investigate this system for TV — >■ oo on the original and large time scales. 

We show that for some a G (0, s) at times log N -\- t^t ^'^,N ^ oo the emergence of 
type 2 (positive global type-2 intensity) at a global level occurs, while at times log N + tN , 
with tN ^ oo we get fixation on type 2 and on the other hand with In ^ —oo as N ^ oo 
asymptotically only type 1 is present. 

We describe the transition from emergence to fixation in the time scale a~ ^ log N + t,t G R 
in the limit A'^ — >■ oo by a McKean-Vlasov random entrance law. This entrance law behaves 
for t — > — oo like 'We""'*' for a positive random variable *W. The formation of small droplets 
of type-2 dominated sites in times o(log A'^), or 7 • log N,j£ (0, a^^) is described in the limit 
A'^ — > 00 by a measure- valued process following a stochastic equation driven by Poissonian type 
noise which we identify explicitly. The total mass of this limiting (TV — >■ 00) droplet process 
grows like VV*e"* as t — ^ 00. We prove that exit behaviour from the small time scale equals 
the entrance behaviour in the large time scale, namely £[*VV] — C[W*]. 

Keywords: Interacting Fisher- Wright diffusions, mutation, selection, rare mutation, punctuated 
equilibrium, random McKean-Vlasov equation, random entrance law, droplet formation, atomic 
measure-valued processes. 
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1 The model and the results 
1.1 Background and motivation 

Mutation and selection both play essential roles in the evolution of a population. Mutation increases 
genetic diversity, while selection reduces this diversity pushing it towards states concentrated on the 
fittest types. The balance between these two forces is influenced heavily by the effect of migration. 
Randomness enters through resampling (pure genetic drift). 

To give a precise setting for this study we model the population dynamics according to a 
stochastic process arising as the diffusion limit of many individuals incorporating the basic mecha- 
nisms for a spatial population traditionally employed in genetics and evolutionary biology. In this 
model random fluctuations are included in contrast to other infinite population limits which are 
ruled by deterministic differential equations (for example, see |Bu| ) . More precisely the model we 
use arises from the particle model driven by migration of particles between colonies, and in each 
site by resampling of types, mutation and selection. Increase the number of particles as and 
give them mass e. As e — >■ 0, a diffusion limit of interacting multitype diffusions results if the 
resampling rate is proportional to the number of pairs and there is weak selection, i.e. selection 
occurs at a rate decreasing with the inverse of the number of particles per site. Otherwise with 
strong selection, i.e. selection at a fixed rate, we get the deterministic limit, often referred to as 
infinite population limit. 

These stochastic models which describe populations of possibly infinitely many types are called 
interacting Fleming- Viot processes and have been intensively studied in recent years (for example, 
see [EG], [D|, [EKl] - [EK3] . |DGV| . jDGQQj ). The basic mechanisms are migration between 
colonies in the diffusion limit this takes the form of deterministic mass flow between sites and then 
within each colony resampling (i.e. pure genetic drift) which gives the diffusion term, selection of 
haploid type based on a fitness function on the space of types and mutation between types occuring 
at rates given by a transition kernel on the space of types, the latter two in the diffusion limit are 
drift terms giving a deterministic massflow between types. 

The purpose of this paper is to investigate how new types brought in by rare mutation invade 
and conquer the whole population through the composed effect of resampling, mutation, selection 
and migration. 

The first part of the picture we focus on is a scenario (punctuated equilibrium) which has received 
considerable attention in the biological literature (see for example jEGl| . jEG2j and the references 
below). In this scenario a population remains for long periods of time in a nearly stationary state 
(quasi- equilibrium or .stasis) until the emergence of one or several rare mutants of higher fitness 
that then take over the population relatively quickly. The explanation of this phenomenon has two 
aspects tunnelling between local fitness maxima and the effect of the spatial structure. 

For example one explanation of this tunnelling was given by Newman, Cohen and Kipnis |NCK] 
in the context of Wright's theory of evolution in an adaptive landscape (see [Wrl) . [Wr2| .|W j ). In 
this explanation the long times between transitions from one quasi-equilibrium to another cor- 
responds to "tunnelling" between one adaptive peak and another in a situation in which local 
stabilizing selection maintains a population near a local peak. In this setting, the Ventsel-Freidlin 
theory of small random perturbations of dynamical systems asserts that the actual transition, 
when it occurs, takes place rapidly. Thus this scenario would be consistent with the observations 
of paleontologists of periods of rapid evolution separated by very long periods of "stasis" . This is 
a stochastic effect not present in the deterministic version, that is, the infinite population limit, 
but must be formulated in terms of a stochastic diffusion limit, which we consider here. 

A second part of the picture is the role of geographic space and its strong effects on the 
qualitative behaviour. Namely typically a large population is subdivided into small subpopulations 
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(colonies) occupying different geographic regions. Then tunnehing between adaptive peaks can 
occur in these subpopulations and then spatial migration of individuals from a colony corresponding 
to a higher adaptive peak can spread and result in the take-over of the entire population which 
then fixates on the fitter type. 

Our goal is to develop a rigorous framework to discuss a model incorporating these features 
of a punctuated equilibrium (by rare mutation) but with a different driving mechanism for the 
tunnelling starting from rare mutants than in the literature quoted above. In this context our 
goal is also more generally to give a framework which provides insight into the respective roles 
of mutation, selection and spatial migration in the emergence and spatial spread of rare mutants 
corresponding to a higher adaptive peak (quasi-equilibrium) . 

In this paper we develop the scenario in the two-type case, one type of low and one type of high 
fitness. The full picture meaning (1) an infinite hierarchy of fitness levels, (2) with AI > 2 (instead 
of il/ = 1 here) types on the successive levels and (3) all this embedded on a countable geographic 
space instead of {1, 2, • • • , N}, which approximates is developed in |DGsel| . 

Our goal in this paper is to exhibit how a rare mutant which is fitter than the types in the 
current population invades the population, by starting to take over a sparse set of colonies by 
selective advantage and then spreads as an increasing droplet in space till the complete population 
is finally taken over. The basic technique to do so is to consider the limit — > oo. 

Indeed using this technique of taking N ^ oo we can produce limiting objects, which have 
a simple and fairly explicit description. However due to the interaction between the migration, 
selection, mutation and resampling mechanisms, we have to develop some new ideas to adapt the 
method of multi-scale analysis to resolve some delicate conceptional and mathematical problems 
that arise in this case. (In previous work |DG99j the qualitative analysis of the longtime behaviour 
did not include mutation.) 

A further point is that we give here a rigorously treated example for two-scale phenomena and 
develop as mathematical tool the notion of random nonlinear evolutions, more specifically solutions 
with random McKean-Vlasov entrance laws from time — oo. The technique of random entrance 
laws allows to connect the separating time scales. This phenomenon is of relevance in many other 
applications. 

In order to describe the early occurrence of rare mutants we work with a description of the 
sparse set of colonies conquered by the rare mutants via atomic measure-valued processes which 
in the limit N oo can be described by stochastic processes driven by (inhomogeneous) Poisson 
random measures and in order to calculate the intensity measure we are using excursion theory. 

We have developed in |DGsel| a technique to extend this picture to infinite geographic space, 
multiple types on each level of fitness and infinite hierarchy of fitness levels. There we get a 
good set-up to study the scenario of punctuated equilibria. However this analysis still suffers 
some shortcomings. The restriction of our method is that we assume that the different time and 
space scales separate in the limit of large times and large space scales which is of course only an 
approximation but allows us to make precise the notion of a quasi-equilibrium as equilibrium of 
a limiting non-linear McKean-Vlasov dynamic. Moreover the techniques can be extended to the 
multitype case and then allow to define the concept of a quasi-equilibrium in a mathematically 
rigorous fashion as equilibria of certain limiting dynamics in the various time scales. 

1.2 The model and key functionals 

We begin here below with the description of the problem for the precisely stated model, define key 
functionals and formulate the problem. 
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1.2.1 The model 

We define a system of A^-exchangeably interacting finite populations {d > 0) with two types {1,2} 
with fitness and 1 respectively, selection rate s > and migration rate c > 0. 

This means that we study the stochastic process (X^(t))t>o, which is uniquely defined as 
follows: 

(1.1) X^{t) = ((xf (z, t), x^{i, t))- z - 1, . . . , TV), 

(1.2) xf(z,0) = l, z = l,---,iV, 

satisfying the wcUknown SSDE where i G {1, 2, • • • , N}: 

(1.3) dxf(i,t) =c(xf(t)-xf(i,t))di - sx^{i,t)x^ii,t)dt~'^x^{i,t)dt 



\J d ■ Xi [i, t)x2 [i, t)dwi{i, t), 



(1.4) dx^{i,t)=c{x^{t)-x^{i,t))dt + sx^ii,t)x^ii,t)dt+^x^{i,t)dt 



\J d ■ Xi{i, t)x2 (i, t)dw2{i, t), 



where {{wi{i, t))t>o, i — 1, ■ ■ ■ , N} are i.i.d. Brownian motions and z«2, wi are coupled via W2{i, t) = 
-wi{i,t), 

N 

(1.5) Sf(i) = ^a;f(i,i) with f = 1,2 

i=l 

and 

(1.6) xi{t) = -^Xi{t) with ^=1,2. 

Without loss of generality we can assume for the resampling rate d that d = 1 and will do this 
except where we wish to indicate the roles of the parameters c, s, d, to. The simple collection of N 
one-dimensional stochastic differential equations in (|l.ip - (|1.4p allows us to focus first on the key 
features of emergence, which are already complicated enough as we shall see. 
Step 2 (Key Junctionals and main objectives) 

The description of the system proceeds via four objects, namely either locally by a sample of 
tagged sites or globally by the empirical measure over the whole spatial collection and furthermore 
we use as well an atomic measure to globally describe sparse (in space) spots of type two mass 
respectively the Palm measure to describe this locally by formalizing the concept of a typical 
type-2 site. The asymptotic analysis of these four objects leads to propagation of chaos results, 
respectively, nonlinear McKean-Vlasov limiting dynamics and atomic measure-valued processes 
driven by Poisson processes and formulas for Palm measures based on this. 

Since it often suffices to consider the tagged site 1, i.e. x^{l,t), in the sequel to designate a 
tagged site we set 

(1.7) xf (i) =xf (l,t) ;£=1,2. 

If we are interested in the local picture we consider for some L G N the tagged sites 

(1.8) (xf(l,t),---,xf(L,i)), ^=1,2. 
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We abbreviate the marginal law of the type-2 mass as 

(1.9) f^^' ■.= C[x^ii,t)]eri[0,i]). 

The empirical measure process gives a global picture and is defined by: 
1 ^ 

(1.10) E^it) ^ E '^(.f e ^(^({1' 2})) 

1=1 

and the empirical measure process of cither type is defined by 
1 ^ 

(1.11) S^(t,^):=-^<5,«(,,,)e7'([0,l]), ^ = 1,2. 

1=1 

Note that since x^{i,t) + cc^ {i,t) = 1 it suffices in the case of two types to know one component 
of the pair in (|l.lip . the other is then determined by this condition. Then we can for two types 
effectively replace V{V{{1, 2})) by ^([0, 1]) as states of the empirical measure. 

Next we have to describe the sparse sites where type 2 appears. First consider N sites starting 
only with type 1 mass at time and evolving in the time interval [0,To]. As a result of the rare 
mutation from type 1 to type 2 at rate ^ mutant mass will appear and then form non-negligible 
colonies at a sparse set of the total of N sites, but the typical site will have mass of order o(l) as 
— > oo. We can now again take a global perspective, where we describe the sparse set of colonised 
(by type 2) sites or a local perspective, where we consider a typical colonised site via the concept 
of the Palm measure. We introduce both viewpoints successively. 

Turn to the global perspective. In order to keep track of the sparse set of sites at which 
nontrivial mass appears we will give a random label to each site and define the following atomic- 
measure-valued process. 

We assign independent of the process a point a{j) randomly in [0, 1] to each site j € {1, . . . , N}, 
that is, we define the collection 

(1.12) {a(i), .7 = I7 • • • : are i.i.d. uniform on [0, 1]. 

We then associate with our process and a realization of the random labels a measure- valued process 
on ^([0, 1]), which we denote by 

N 

(1.13) 2^'^^Y.'^2iM)SaU)- 

J = l 

This description by 2^'™ is complemented by a local perspective, arising by zooming in on 
the random location where we actually find mass by studying a typical type-2 dominated site. In 
order to make precise the notion of a site as seen by a typical type-2 mass, i.e. a site seen from 
a randomly (among the total population in all TV sites) chosen individual of type 2, we use the 
concept of the Palm- distribution, which we indicate by a hat. Define 

(1.14) /2f = C[x^{l,t)], by Ji^iA) =(j x^f (rfx)j /E[x^{l,t)], 

where = C[x2 (1,^)]- The law Jl^ describes now for an i.i.d. initial state the law of a site, 
typical for the type-2 population at time t in the limit TV — ^ 00. Namely we can think of this as 
picking a type-2 individual at random and then determine the site where it sits. Note that 
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The scenario 

The basic objective of tliis paper is to describe the emergence of the more fit type, i.e. type 2, 
using the above functionals of the process. This emergence occurs in three regimes, two are of short 
duration separated by a third of long duration and arise as follows in an asymptotic description as 

iV ^ GO. 

(1) In the first regime during times of order 0(1) rare mutants arise at a small set of sites 
(droplet) where they locally make up substantial part of the population and then develop mutant 
"droplets" by migration. We will see later in Subsection [331 in all detail that the mutant population 
during this stage develops as a type of measure-valued branching process. We refer to this stage 
as droplet formation. 

(2) The next stage, namely, the stage at which type 2 has very small but 0(1) frequency at a 
typical (i.e. randomly chosen site) site, or equivalently at the macroscopic level has 0(1) density, 
occurs at time O(logA^), we refer to this as emergence. 

(3) The last regime at which the macroscopic density of type 2 reaches a frequency arbitrarily 
close to 1 is taking a further piece of time but is again only of order 0(1) and is referred to 
as fixation. For the regime between emergence and fixation we give a description by a limiting 
evolution in the form of an entrance law evolving from time — oo and type-1 states to type-2 states 
at time +00 and hence altogether with time index < S R. 

In the sequel we will analyse these three regimes of droplet formation, emergence and fixation 
by studying the behaviour as — > 00 of 



in finite times as well as in larger A'^-dependent time scales thus capturing each of the three regimes 
described above. 

1.3 Statement of results 

We have three tasks, first to determine the macroscopic emergence time scale, that is the time scale 
at which the level two type appears at a typical site or equivalently has positive intensity, second, 
to describe the pre-emergence picture, in particular the droplet formation respectively the limiting 
droplet dynamics and thirdly to describe the limiting dynamic of fixation after the emergence with 
which the take-over by type 2 then occurs. 

We begin the analysis with introducing the two different limit dynamics in the subsequent two 
subsubseetions and then subsequently the emergence, fixation and convergence results are stated 
in a further subsubsection. 

1.3.1 The Hmiting random McKean-Vlasov dynamics and random entrance laws 

We start with the initial condition SAr(0, {1}) = Si (recall (jl.lip ). that is only type 1 appears. 
The objective is to establish the emergence and fixation of type 2 in times of the form t^ = 
C ■ log N + t, to identify the constant C, and to identify the limit of ^^{tN, •)) as a function of t in 
the process of emergence and determine the dynamics in t of the fixation (takeover) which leads to 
the concentration of the complete population in type-2 as — > 00. In the next subsubsection we 
shall then discuss the behaviour at early times, i.e. times ijv with limsup((logtjv)/log A^) < C, in 
the stage of droplet formation. 

The key ingredients in the limiting processes in times tN for the local and global description 
are: 

• the limiting (nonlinear) McKean-Vlasov dynamics, 



(1.16) 



E^it), {x^ii, i); £ 1, 2; i 1, • • • , i} and I 



■N.m 




• the entrance law from —00 of the McKean-Vlasov dynamic and 
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• random solutions to the McKcan-Vlasov equation. 

In order to define these ingredients we proceed in three steps, we recall first in Step 1 the "classical" 
McKean-Vlasov limit (associated with a nonlinear Markov process) before in Step 2 and Step 3 we 
introduce the two new objects. 

Step 1 Consider the above system (|l.ip - (|1.4p of N interacting sites with type space K ~ {1, 2} 
starting at time t = from a product measure (that is, i.i.d. initial values at the N sites). The 
basic McKean-Vlasov limit (cf. |DG99| . Theorem 9) says that if we start initially in an i.i.d. 
distribution, then 



(1.17) {EN{t)}o<t<T =^ {Cth 



<t<T, 

where the 7^(7'(IK))-valued path {Ct}o<t<T is the law of a nonlinear Markov process, namely the 
unique weak solution of the McKean- Vlasov equation: 

(1.18) ^ = iLfrj^t, 

where for tt G 7'('P(K)), and F{fi) = /(/i(2)), 

(1.19) L^F^c[Jy7r{2^dy)-x]^ + sx{l-x)^ + ^xil~x)^ 

and the * indicates the adjoint of an operator mapping from a dense subspace of Cf,(£', M) into 
Ci,{E, R) w.r.t. the pairing of V{E) and Ci,{E, R) given by the integral of the function with respect 
to the measure. Similar equations have been studied extensively in the literature (e.g. [Gar]). The 
process {Ct)t>o corresponds to a nonlinear Markov process since Ct appears also in the expression 
for the generator L. 

As pointed out above in (|l.lip . in the special case IK = {1,2}, we can simplify by considering 
the frequency of type 2 only and by reformulating ()1.18p living on 'P{V{{1,2,})) in terms of 
jC,t{2) G V[0, 1]. This we carry out now. 

Namely we note that given the mean-curve 



(1.20) m(t)= J yCt{2){dy), 

[0,1] 

the process {£t{2))t>o is the law of the strong solution of (i.e. the unique weak solution) the SDE: 



(1.21) dyit) = c{m{t) - y{t))dt + sy{t){l - y{t))dt + ^ dy{t){l ~ y{t)dw{t). 

Then informally {Ct)t>o corresponds to the solution of the nonlinear diffusion equation. Namely 
for t > 0, £t(2)(-) is absolutely continuous and for 

(1.22) Ct{2){dx) ^u{t,x)dx e7'([0,l]) 

the evolution equation of the density u{t, ■) is given by: 

d d f d d d"^ 

(1-23) ^■"(*'^) = / yu{t,y)dy-x]u{t,x)}~s—(x{l-x)u{t,x))+- — {x{l-x)u{t,x)). 

[04] 

Step 2 In order to describe the emergence and invasion process via S^v, we introduce in this 
step and the next two extensions of the nonlinear McKean-Vlasov dynamics which describes only 
the limiting evolution over finite time stretches, t € [io, to + T] given an initial condition Ct„ which 
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wc then follow as to — > — oo. Hence we have to define the dynamics for t e (—00,00) in terms of 
an entrance law at —00. 

Since we consider the limits of systems observed in the interval const ■ \ogN + [— ^] with T 
any positive number, that is setting to{N) = const ■ \ogN — ^, we need to identify entrance laws 
for the process from —00 (by considering T — > 00) out of the state concentrated on type 1 with 
certain properties. 

Definition 1.1 (Entrance law from t = —00) 

We say in the two-type case that a probability measure-valued function £ : R — > V{[0, 1]), is an 
entrance law at —00 starting from type 1 i/ (£t)te(-oo,oo) such that Ct solves the McKean-Vlasov 
equation il.ll^) and Ct — > 5i as t — > —00. 

In the case of more than two types we work with maps £ : M — > V{V{JS.)) where we require as 
t — > —00 that Ct converges to measures 5^ with n e 7'(K) such that ^ is a measure concentrated 
on the lower level types. □ 

The existence of such an object is obtained in part (c) of the proposition below. 

Step 3 The usual formulation of the McKean-Vlasov limit requires that we start in an i.i.d. 
initial configuration. This is not sufficient for us since even though we consider systems observed 
in finite time stretches we do so only after large times, namely, after the time const ■ log N with 
N 00. Hence in our context the McKean-Vlasov limit is valid in a fixed time scale but if the 
system is viewed in time scales that depend on N this can (and will) break down since we only 
know that the initial state is exchangeable and this then leads to a random solution. 

We will indeed establish that the emergence of rare mutants gives rise to "random" solutions 
of the McKean-Vlasov dynamics. In particular we will show that the limiting empirical measures 
at times of the form ClogiV -|- t are random probability measures on [0, 1] and therefore given by 
exchangeable sequences of [0, 1]- valued truely exchangeable random variables which are not i.i.d., 
that is, the exchangeable cr-algebra is not trivial. This means that the empirical mean turns out 
to be a random variable and since this is the term driving via the migration the local evolution 
of a site in the McKean-Vlasov limit, the non-linearity of the evolution equation comes seriously 
into play. However once we condition on the exchangeable cr-algebra, we then get for the further 
evolution again a deterministic limiting equation for the empirical measures, namely the McKean- 
Vlasov equation. The reason for this is the fact that conditioned on the exchangeable cr-algebra 
we obtain on asymptotically (as N — )■ 00) i.i.d. configuration to which the classical convergence 
theorem applies. Using the Feller property of the system, which is a direct consequence of the 
duality, we get our claim. 

This leads to the task of identifying an entrance law in terms of a random initial condition 
at time —00. A consequence of this scenario is that when we use the duality from time Tjv to 
Ttv + 1, we apply it to a random initial state in the limit and we therefore have to use because of 
the non-linearity of the evolution the appropriate formulas. 

The above discussion shows that we need to introduce the notion of a truly random McKean- 
Vlasov entrance law from —00. 

Definition 1.2 (Random solution of McKean-Vlasov) 

We say that the probability measure-valued process {£(t)}tgR is a random solution of the 
McKean-Vlasov equation U.18\) if 

• {Ct ■ t G M} is a.s. a solution to U.18]) . that is, for every to the distribution of {Ct '■ t > to} 
conditioned on J-t^ ~ o{Cs '■ s < to} is given by S{fit}t>to ^^^'''^ ^ solution of the 
McKean-Vlasov equation with fito ~ Ct„, 

• the time t marginal distributions of {Ct '■ t G K} are truly random. □ 
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The key result of this subsubscction on the objects introduced in the previous three steps is 
now the foUowing existence and uniqueness results on the solution of the McKean-Vlasov equation 
(|1.18p and of its (random) entrance laws from t = —oo: 

Proposition 1.3 (McKean-Vlasov entrance law from — ooj 

(a) Given the initial state fiQ G ^([0, 1]) there exists a unique solution 

(1.24) Ct{2){dx) ^ fitidx), t>to 

to iL18\) with initial condition Ctg(2) = /xq. 

(b) If s > and J^^ x^to(dx) > 0, then this solution satisfies: 

(1.25) \im Ct{2){dx) = di{dx). 

>CJO 

(c) There exists a solution (£j*(2))tgR to equation U.18]) satisfying the conditions: 

(1.26) lim C;*{2) = So, 

lim C*t*{2) = (5i 

t^OO 

f xC*o*i2,dx) = 1. 

"'[0,1] ^ 

This solution is called an entrance law from —oo with mean ^ at t ~ 0. 

(d) We can obtain a solution in (c) such that: 

(1.27) 3 a e (0,s) and Aq e (0, oo) such that lim e"l*l / xC**{2,dx) = Aq. 

J[o,i] 

(e) The solution of \L18]) also satisfying ^1-27^ for prescribed Aq is unique and if Aq G (0, oo) 
then a is necessarily uniquely determined. 

For any deterministic solution 

(1.28) A, i e M 
to HTTS]) with 

(1.29) < limsupe"!*! /" xCt{2,dx) < oo, 

t-i.-oo "'[0,1] 

the limit A = limt^_oo e"'*' /jg ^ xCt{2, dx) exists. 

If A> Q, then {Ct,t e M} is given by a time shift of the then unique {C^* ,t G R} singled out 
in [TF?^ , i.e. 

(1.30) Ct^Cr+r, T = a-i log A. 

^0 

For future reference we define {CDtem to be the unique solution satisfying 

(1.31) lim e"l*l / xCl{2,dx) = 1 for some a £ (0, s). 

(f) Any random solution {Ct)t£M to U.18\) such that 

(1.32) limsupe"l*l£;[/ xCt{2,dx)] < oo, liminf e"l*l[ / xCt{2,dx)] > a.s., 

is a random time shift o/(£(*)(gR (and of {Cf)t£m.). □ 
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Example 1 Let be a solution satisfying il.23\) . ^1.27^ and for a given value of A let t be a true 
real-valued random variable. Then is a truly random solution. This can also be viewed 

as saying that we have a solution with an exponential growth factor A which is truly random. 

Remark 1 We shall derive later on a bound from above on the cardinality of our dual process which 
will imply that the first relation in il.32\) must be satisfied for a limiting dynamic (as N ^ oo) of 
our empirical measures Sjv and furthermore we get a lower bound implying that A is a.s. positive 
for this limiting dynamic, (see {3.146^ and Subsubsection W.2.6\) . Then we are able to use the 
identification of the solution given in ll.30\} or of the one satisfying il.31]) to identify the limiting 
dynamic of the process of empirical measures. 



1.3.2 Limiting dynamics of sparse sites with substantial type-2 mass: Droplets 

We next describe the limiting dynamic of tlie sparse set of sites wliicli Irave been colonized by type 
2 prior to the onset of emergence. There are two time regimes, first an initial finite time horizon 
of order 0(1), then large times i^r — > oo as iV — > oo but which remain << logiV so that we 
are in the precmcrgcnce regime with still a global density of type 2 which is asymptotically zero. 

It is very unlikely that a randomly chosen site has at time 0(1) mass of type 2 at least e > 
as iV — )• oo. In fact an explicit calculation (sec p.703p ) shows that the transition density decays 
like constN~^ . On the other hand the number of sites increases with so that here we have a 
compensation provided wc do not look at only one single tagged site but the complete population 
at all sites and the result is that considering all sites there is a finite random number of sites with 
substantial type 2 mass. Newly added sites in this set of sites arise from a process entering from 
the state 0. In fact we obtain here a Poisson distribution in the limit. This whole scenario will be 
made precise using entrance laws from state which we derive using diffusion theory and excursion 
law theory. 

Therefore before formulating the main results of this section we first consider entrance laws of 
a single site with no mutation from type 1 to type 2 which provides us with a key ingredient, the 
excursion measure Q. Wc state the wellknown fact: 

Lemma 1.4 (Single site: entrance and excursion laws) 

(a) Let c > 0, c? > 0, s > 0. Then is an exit boundary for the the Fisher-Wright diffusion 

(1.33) dx{t) = ~cx{t)dt + sx{t){\ - x{t))dt + d ■ x{t){l - x{t))dw{t), 

which then has a a-finite entrance law from state at time 0, the a-finite excursion law 

(1.34) Q = Q'^-'^''' 
on 

(1.35) Wo := {w e C([0,oo),M+), w{Q) = 0, w{t) > for < t < ( for some C e (0,oo)}. 

(b) Moreover, denoting by the law of the process started with w{0) — e and e > 0,Q is given 

by: 

(1.36) Q(.)-lim-^, 

e->o S{e) 

where S{-) is the scale function of the diffusion \LS3\) . defined by the relation, 

(1.37) Pe(T^ <oo) = ||^, 0<£<7?<oo, 



1 THE MODEL AND THE RESULTS 



12 



where is the first hitting time of rj. 

For the Fisher- Wright diffusion S is given by (cf. IRW^ . V28)) the initial value problem: 

(1.38) 5(0) = 0, 



dx (l-x)2'=' 
so that 

(1.39) lim ^ = 1. 

e->0 e 

(c) The measure Q is a-finite, namely for any r/ > 0, C as in iL35\) . 

(1.40) Q{{w : ((w) > 7/}) < oo, 

(1.41) Q(sup(w(t)) > ?]) = Q{T^ < oo) = — ^ oo as ry ^ 0, 



an. 



d 



(1.42) / xQ(sup(«;(i)) € dx) = oo. □ 
Jo t 



Proof It is wellknown that is an exit boundary for (|1.33p . In this case (a) - (c) then follows 
immediately from the results of Pitman and Yor [PYj . Section 3. 



We next identify the limit in distribution of 1^'"^ (defined in (|1.13l) ) as oo. Let 
(1.43) A4a{[0, 1]) = the set of finite atomic measures on [0, 1], 

equipped with the weak atomic topology (see |EK4| and in this present work in (|1.57p - (|1.63p below 
for details on this topology) which forms a Polish space. 

We will need Poisson random measures in four variables, the actual time called s G [0,t], the 
location parameter a € [0, 1], the mutation and immigration potential called u G [0,oo) and the 
path of an excursion called w G Wo- This allows us to obtain the droplet dynamic (!lJ")tG[o,cx3) and 
its properties: 

Proposition 1.5 (A eontinuous atomic-measure-valued Markov process) 

Let'2Q{t) — yi{t)5ai G A^a([0, 1]) where the yi(t) are independent solutions of the SDE (T 



(describing the initial droplet and its evolution) and let N{ds,da,du,dw) be a Poisson random 
measure on ( recall iL35\) for Wq ) 

(1.44) [0, oo) X [0, 1] X [0, oo) X Wo, 
with intensity measure 

(1.45) dsdaduQ{dw), 

where Q is the single site excursion law defined in lll.36\) in Lemma \1.4\ 
Then the following three properties hold. 

(a) The stochastic integral equation for (II™)t>o a process with values in A^a([0, 1]) is given as 

(1.46) 2T^Toit)+ / / / w{t~ s)5aN{ds,da,du,dw), t>0, 



J[o,i] Jo Jw. 
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where (5a(-) G A^i([0, 1]) and where q{s,a) denotes the non-negative predictable function 

(1.47) g(s,a):=(m + d™([0,l])), 

has a unique continuous A4ai[0,l]) -valued solution, which we call 

(1.48) OT)t>o- 

(b) (!l™)t>o "is a 7Wa([0, l])-valued strong Markov process. 

(c) The process {2]^)t>o has the following properties: 

• the mass of each atom observed from the time of its creation follows an excursion from zero 
generated from the excursion law Q (see U.36]) ). 

• new excursions are produced at time t at rate 
(1.49) m + dr([0,l]), 

• each new excursion produces an atom located at a point a G [0, 1] chosen according to the 
uniform distribution on [0,1], 

• at each t for e > there are at most finitely many atoms of size > e, 

• i — >■ J"'([0, 1]) is a.s. continuous. □ 

Remark 2 The process (Il™)f>o can be viewed as a continuous state analogue of the Crump-Mode- 
Jagers branching process with immigration (see Subsubsection \J.2.4\ Step 2 for a review and more 
information on this type of processes). We shall see in jj. 7ff[ ) that the total mass grows exponentially 
as in a supercritical branching process. □ 

Remark 3 A necessary and sufficient condition for extinction of the analogue of 2* for a general 
class of one- dimensional diffusions was obtained by M. Hutzenthaler f JlHu^ ). In the Fisher- Wright 
case with c > 0, s > the fact that the probability of non- extinction is non-zero follows from the 
Proposition □ 

Proof (a) and (b) The existence and uniqueness of the solution to (|1.46p and the strong Markov 
property foUows as in |D-Lij and |FLj . 

(c) foUows directly from the construction via the Poisson measure N given in (|1.46p . 

Remark 4 We can enrich the process (J™)t>o to include the genealogical information, namely 
which mass results from which mutation. For that purpose we have in addition to the location 
record the birth times of atoms due to migration (successful colonization). This means we have to 
split the rate at which excursions are created into mutation at the site and immigration from other 
sites. At time we start with 

(1.50) Toit) 

describing the initial atoms we consider to be present at time 0. 
The genealogical enrichment denoted 

(1-51) (5r)t>o 

is a measure-valued process on a richer set E and is obtained as follows. 
Let 

(1.52) £; = ((R+ X [0, 1]) U 0)^ 
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and for y (z E define T{y) ~ min{n : y{n) ~ 0} — 1. 

Let N{ds, da, du, dw) be a Poisson random measure on [0, oo) x [0, 1] x [0, oo) x with intensity 
measure dsdaduQ{dw) where Q is the single site excursion law. Then the following stochastic 
integral equation has a unique continuous solution, H™; 

(1.53) 5r=5SW+/ / / V wit-s)Sy.^^,,a)Nids,da,du,dw), 

Ja J E Jo Jwo 

where (s, a) is short for ((s, a), 0, 0, • • 

(1.54) q{s, y,a)=m if T{y) = 0, q{s, y, a) := c5™ (y), T{y) ^ 0, 
and 

,^ ^j-N o : E X E ^ E, 2/1 o 2/2 = (z/i, 2/2:0) (concatenation) with , 
^ ' y' = (2/'(l),---,2/'(r(2/'))) ,0 = (0,0,---)- 

Here measure-valued process on E and y o (s, a) denotes the offspring of y with birth 

time s and location a € [0, 1]. Moreover, Il™({2/ : T{y) ~ oo}) = a.s.. 

Some topological facts concerning atomic measures. We finally recall the definition 
and some facts on the weak atomic topology with metric pa on the space of finite atomic measures 
A^a([0, 1]) due to Ethier and Kurtz. Recall that we have the topology on 7VJ([0, 1]) induced by 
the weak topology which is induced by the Prohorov metric p. Next choose a function , where 
: [0, oo) -J> [0, 1] is continuous, nonincreasing and \['(0) = 1, \1/(1) = 0. Then for ly, p e Ma{[0, 1]) 
one defines 

(1.56) pa{v,p) ■■= p{v,p) 



sup 

0<£<1 



*(^^ A l)/i(dx)/i(d2/) - / / -^{^-^^^ M)y{dx)y{dy) 
[04] "'[04] ^ J [04] J [04] £ 



We refer to pa as the Ethier-Kurtz metric. 

The space (A^a([0, 1]), Pa) is a Polish space and the topology, in other words convergence, does 
not depend on the choice of ^' (the geometry of the space of course does). 

The following lemma collects what we need on the relation between the weak and the weak 
atomic topologies, which arc different. 

Lemma 1.6 (Weak atomic topology and weak topology) 

(a) A sequence of random finite atomic measures on [0,1], (/irOneN, converges to p in the weak 
atomic topology if and only if 

(1.57) pn converges weakly to p 
and 

(1.58) pl{[Q,l])-^y{[Q,l]), where m*([0, 1]) = ^({^})'^- 

°° 3:e[04] 

(b) If the following three properties hold for /i„, /i G A^a([0, 1]) (here denotes weak convergence 
as n oo ): 

(1.59) Pn =^ the ordered atom sizes converge and the set of atom locations converges, 
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then: 

(1.60) pa{fln,f^)^0- 

(c) A continuous Mf -valued process with a.s. continuous (in the weak topology) sample paths 
of the form X] ^j(0^2;i(t) such that '^af{t) is a.s. continuous, has also sample paths a.s. in 

i 

C([0, oo), (A^/([0, l],pa)); where pa is the Ethier-Kurtz metric. 

(d) Consider a sequence {Zn,N G N} of atomic measure-valued processes with cadlag paths in the 
weak atomic topology, so that has the form 

(1.61) ZN{t) = ^ aN,i{t)6x^^^(^t) 

i 

for suitable functions (oft) aN.i,XN,i- Assume furthermore that 

(1.62) {£(Zjv),iV e N} m'P(D([0,oo),(7W/([0,l],/9))) is relatively compact. 

Then the compact containment condition will hold also in (A^a([0, 1]), Pa), */ o-nd only if for 
each T > and 6 > there exists e > such that 

(1.63) infP 

N 

Here 4* is as in il.56]) . 

Proof This follows from work by Ethier and Kurtz |EK4| . namely (a) - Lemma 2.2, (b) -Lemma 
2.5, (c) - Lemma 2.11, (d) - Remark 2.13. 

1.3.3 Emergence time, droplet formation, fixation dynamic for A^-interacting sites: 
Statement of results 

We now have the ingredients and the background to continue the analysis and state all the results 
of the finite population model with two types IK = {1,2} with fitness and 1, respectively, s > 
and d > Q and with TV exchangeable sites and with c > 0. We start with only type 1 present 
initially. The exposition has three parts, the emergence, the preemergence (droplet formation) 
and the fixation. 

Part 1: Emergence 

The goal is to describe in mathematical precise form the initial formation of germs for the 
expression of the fitter type 2 by mutation which subsequently expand which then leads finally 
to the global emergence of the fitter type, and over the period of order 0(1) the increase of the 
mass of this type continues until it takes over almost the entire population (fixation). We state 
the results on this scenario in four main propositions. 

First on emergence are Proposition II . 71 which shows that (global) emergence occurs at times of 
order const dog and which identifies the constant and then Proposition ll.l2l and Proposition ll.151 
which identify the limiting dynamics in t after global emergence of type 2, meaning wc study the 
system observed in times const ■ logiV + t with t G M. The Proposition 11.91 explains the emergence 
behaviour by describing the very early formation of droplets of "typc-2 colonies" . 

In addition to the four main statements mentioned above the exact properties in the early stage 
of droplet formation are given in Proposition ll . lOl and Proposition II . 1 H Namely droplet formation 
occurs in a random manner in the very beginning followed by a deterministic expansion of the 
droplet size leading to emergence on a global level. 




A l)ZN{t,dx)ZN{t,dy) - ^a^,,(t) 



< 5 



>l-5. 
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A remarkable faet we state after the part on fixation in Proposition 1 1 . l41 which is estabhshing 
a elose eonneetion between exit behaviour in short seale and entrance behaviour in the same scale 
but placed much later such that both time intervals are separated by a long time stretch. 

We begin with the emergence times. 

Proposition 1.7 (Macroscopic emergence and fixation times) 

(a) (Emergence-time) 

There exists a constant a with: 

(1.64) < a < s, 

such that if T/v = ^ log N, then for t ^ M. and asymptotically as N oo type 2 is present at times 
T/v + 1, i.e. there exists an e > such that 

(1.65) liminf F[a;^(TAr +t) > e] > 0, 

N^oo 

and type 2 is not present earlier, namely for 1 > s > 0: 

(1.66) lim limsup[P(2:f (Tat + t) < 1 - e] = 0. 

(b) (Fixation time) 

After emergence the fixation occurs in times 0{1) as N — !> oo, i.e. for any e > 

(1.67) lim limsupP[a;f (Tjv + t)> e]^0. 

(c) The constant a can be characterized as the Malthusian parameter for a Crump-Mode- Jagers 
branching process denoted {Kt)t>a which is explicitly defined below in \8.139\) . The constant a can 
alternatively be introduced as exponential growth rate for the limiting droplet growth dynamic H™, 
see hl.SOjl or in terms of the excursion measure of a diffusion as specified below in ^3768^. □ 

Corollary 1.8 (Emergence and fixation times of spatial density) 

The relations il.65] ). \1.66\ ) and ( [j.6'7| j hold for respectively as well. □ 

Part 2: Droplet formation 

This raises the question how the global emergence of type 2 actually came about and what 
is the role of a in the forward dynamics (instead of the view back from emergence) and why do 
we have a random element in the emergence. We will now demonstrate that rare mutation and 
subsequent selection with the help of migration produce in times 0(1) a cloud of sites where the 
type-2 is already manifest with substantial mass at a given time and this cloud then starts growing 
as time increases. This growth of total type-2 mass is exponential with a random factor arising as 
usual in the very beginning (i.e. at times growing arbitrarily slow with A'^) and hence in particular 
the growth process is given by a randomly shifted exponential. We call the growing cloud of type-2 
sites a droplet (which in fact in the euclidian geographic space is literally accurate). The droplet 
will be described using the atomic measure introduced in (jl.lSp and by the Palm measure 

explained in (|1.14|) . 

For the purpose of making our three regime scenario precise we first need to investigate how for 
a finite time horizon the site of a typical type-2 mass looks like in the limit N ^ oo and if we can 
show that such a site exhibits nontrivial type-2 mass. Then we have to see how fast the number 
of such sites grows and reaches size eN at time logN -\- t{e). 

One might expect that a growth of the droplet at exponential rate a would be the appropriate 
scenario. This scenario will also show that some randomness created initially, i.e. times 0(1) as 
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N ^ oo, remains in the system up to fixation of type 2. Tliis randomness arises since up to some 
finite time T there will be among the iV-sites as N oo, 0(1) such sites where the mass of type 
2 exceeds some level 6 > 0. In the limit this will result in a compound Poisson number of germs 
for expansion. Only descendants of this early mass will make up at much later times the bulk of 
the type-2 mass, since we have exponential growth. 
Therefore we study 

• the configuration of type-2 mass on the sparse set of sites and its limiting law as TV — ^ oo for 
a finite time horizon, i.e. Hf'™ and p,^ for t € [0,T], 

• the configuration in a typical type-2 site as time and N tends slowly to infinity, i.e. , 
with ijv t oo but In = o(logiV), 

• the growth of the droplet as time goes to infinity but slower than the time needed for global 
emergence, i.e. and !1^™([0, 1]), with as above. 

To verify our scenario we want to show that /i^ converges for t^r 'f oo to a limit, say /Ij^ which 
is nontrivial, i.e. /J^((0, 1)) — 1. Then we want to prove that the number of sites looking like /I^- 
realisations grows like yV*e" for oo, with i — > oo , t at = o(log N) for some non-degenerate 
random variable W* and a number a* G (0,oo). And finally we have to identify a* as a. 

To verify that, we have to establish either that for some suitable finite and not identically zero 
random variable W* and number a* we have 

N 

(1.68) xf(t7v) :=^^^(i,ijv) W*e"**" 

1=1 

as iV — >■ oo, or alternatively that we need a spatial window of size Ne~°' to find such a type-2 
colonised site with nontrivial probability in that window. 

In order to explain the origin of the randomness in W* we need the behaviour of the law of the 
random variable 

TV 

(1.69) ^x^(z,t)=nf"([0,l]), te[0,T] 

and the localization of this total mass on different sites for iV — > oo in a suitable description, which 
is given by the full atomic measure {2^'"^)t>o- 

In particular if we have established the above scenario, we can conclude that it requires time 
(a*)~^ log -I- t for some appropriate t € K to reach intensity e of type 2 in the whole collection 
of sites and then we can conclude that a* = a. 

We now want to show three things to describe the droplet growth: (1) There is a limiting law 
for the size of the type-2 population in a typical colonized site. (2) Identify the exponential growth 
rate of the number of colonized type-2 sites. (3) Identify the limiting dynamic as A'^ — > oo of 
(3f''™)t>o exactly as the dynamic (J™)t>o- 

Addressing first point (1), (2) (for (3) see below), we shall prove the following on droplet 
formation. We use here the notational convention 

(1.70) ajv = d{bf{N)) if limsup(aAr//(iV)) < b. 

Proposition 1.9 (Microscopic emergence and evolution: droplet formation) 
a) The Palm distribution stabilizes, i.e. 

(1.71) /li^ =^ for tNt^ with tN = o(a-i log N) 



1 THE MODEL AND THE RESULTS 



18 



and the limit has the property 

(1.72) /i-((0,l)) = l. 

The law will be identified in terms of the excursion measure in h3.635\l . 

b) The total type-2 mass II(^'"([0, 1]) grows at exponential rate a* , i.e. \1.68\) holds and 

(1.73) a* = a, with a from Proposition \1.T\ 
Furthermore we have a random growth factor: 

(1.74) C fe^'(tAr)e""*"] =^ C[W*], for tN t oo with tN = d(a-^ logN). 
The random variable W* is non- degenerate. □ 

Remark 5 This result is obtained by showing that for sufficiently large ti and t2' 

(1.75) lim P (|e-"*iS^(ti) - e"*^J^(t2)| >£)<£. 

Remark 6 The mutant population making up the total population at time a^^ log A'^ + t of emer- 
gence dates back to a rare mutant ancestor which appeared at a time in [0, T'(e)] with probability at 
least 1 — £ and T{e) < oo for every e > 0. 

Remark 7 Growth in the spatial Fisher- Wright and the spatial branching model are different in 
nature. In the branching model we have growth of the total type-2 mass at exponential rate s, which 
is due to essentially a random number of families growing at that rate and hence with few sites 
with a large population. In the Fisher- Wright case we get an exponentially ( rate a < s) growing 
number of sites with a type-2 population of at least e. Macroscopic emergence occurs once this 
growing droplet has volume const ■ N . 

In order to understand the structure of the quantity W* arising from (|1.74p . we have to inves- 
tigate the behaviour also in the earlier time horizon t e [0, Tq] of 2^'"^ as N oo. And addressing 
our point (3) from above (|1.70p we obtain the following eonvergenee result. 

Proposition 1.10 (Limiting droplet dynamic) 
As N ^ oo 

(1.76) £[(Jf ■")*>o] =^ C[OT)t>o] 

JY->oc. 

in the sense of convergence of continuous A^a([0, l\)-valued processes where A^a([0, 1]) is equipped 
with the weak atomic topology. □ 

In the growth behaviour of the limit dynamics as iV — > oo we obtained above, we recover 
the quantity a* and W* in the longtime behaviour of the limit dynamic (Il™)t>o in the following 
proposition. 

Proposition 1.11 (Long-time growth behaviour of 2]^) 

Assume that either m > or IIq([0, 1]) > 0. Then the following growth behaviour ofl"^ holds, 
(a) There exists a* such that the following limit exists 

(1.77) lime-"**i?pr([0,l])]e(0,oo). 
We have 

(1.78) a* = a, 
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where a is defined as in Proposition \L7\ (see also \3.lJ^) . 

(b) Recall [TT^ for the law ofW*. Then: 

(1.79) e-"*3™([0, 1]) =^ W* in probability. 

(c) The growth factor in the exponential is truly random: 

(1.80) < Var[ lim e^"*Jr([0, 1])] < oo. □ 

Remark 8 We shall see in the proof that the random variable W* reflects the growth o/II™([0, 1]) 
in the beginning, as is the case in a supercritical branching process and hence E* = logW* can 
be viewed as the random time shift of that exponential e"* which matches the total mass of 1™ for 
large t. 

Given 3™, let vt{o,,h\, < a < & < 1 denote the number of atoms in the interval (a, 6]. Then 
the analogue of the size distribution for the discrete CM J process is given by 

R is reasonable to expect that there is a limiting stable size distribution in the t — > cx) limit similar 
to that in the discrete case but we do not follow-up on this here. 

Part 3: Fixation 

We now understand the preemergence situation and the time of emergence. In order to continue 
the program to describe the dynamics of macroscopic fixation, we consider the hmiting distributions 
of the empirical measure-valued processes in a second time scale log + t, < e K. Define 

1 " 

(1.82) S;^^'"(t):=-5:<5(,«(,iH^+,),,«(,,i,^^,)), teM, (s'°«-(t) e 7'(7'(K))) 

1=1 

and then the two empirical marginals are given as 

N 

N ■ 

i=l 

Note that for each t and given £ the latter is a random measure on [0, 1]. Furthermore we have the 
representation of the empirical mean of type 2 as follows: 

(1.84) xf(i^+t)=/ xE'^^^-{t,2)idx). 

a J[o,i] 

The third main result of this section is on the fixation process in type 2, saying that 5^^^'" 
converges as TV — > cxj and the limit can be explicitly identified as a random McKean-Vlasov 
entrance law starting from time —oo in type 1. 

Proposition 1.12 (Asymptotic macroscopic fixation process) 

(a) For each — oo < t < oo the empirical measures converge weakly to a random measure: 

(1.85) C[{E%>^'''{t,i)}]^^C[{Ct{l)}]=PleV{V{[0,l])), for £^1,2. 
In addition we have path convergence: 



1 " 

(1.83) 5'^«^"(t,^):=-^<5,„(, i^+,), ^=l,2andt€]R, (S^s^"(i, £) e 7'([0, 1])). 



(1.86) w-liiRC[{E'°>'-"{t))tm] =Pe V[C{{~oo,oo),V{V{K)))]. 
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A realization of P is denoted {Ct)ti£R respectively its marginal processes (Ct{l))teR, {^t{'2))teR- 

(h) The laws Pl,i — 1,2 are non-degenerate probability measures on the space V{[0,1]). In 
particular 



(1.87) E 



xCtl2){dx) 

[0,1] 



xCt{2){dx)] 



(c) The process {Ct)t<£m describes the emergence and fixation dynamics, that is, for t G M, and 
e > 0, 

(1.88) lim Pro6[£t(2)((e, 1]) > e] 0, 

(1.89) lim Prob[Ct{2){[l - e, 1]) < 1 - e] = 0, 
with 

(1.90) Ct{2){{0,l)) > , VteM, a.s.. 

(d) For every t gW, always both type 1 and type 2 are present: 

(1.91) Prob[Ct{2)m) = 0)] = 1. 

(e) The limiting dynamic in U.86]) is identified as follows: 

The probability measure P in \1.86]) is such that the canonical process is a random solution 
(recall Definition and entrance law from time — oo to the McKean-Vlasov equation 111.18]} . 

(f) The limiting dynamic in U.86]) satisfies with a as in ^1.64^ : 

(1.92) /:[e"l*l / xCt{2){dx)]^ C[*W] as t^ -oo, 

and we explicitly identify the random element generating P in il.86]) . namely P arises from random 
.shift of a deterministic path: 

(1.93) P = C[t.£C*] , *£ = (log* W)/a, is the time-shift of path by r, 

where C* is the unique and deterministic entrance law of the McKean-Vlasov equation 111.18]} 
satisfying \1.29j} and with projection Ct(2) on the type 2 coordinate satisfying: 

(1.94) e"l*l / xLt{2){dx) — >\,ast^-oo. 

J[o,i] 

The random variable *W satisfies 

(1.95) < *W < oo a.s., E[*W] < oo, < Var{*}V) < oo. 

(g) We have for sat -> oo with sn ~ o{logN) the approximation property for the growth 
behaviour of the limit dynamic by the finite N model, namely: 

(1.96) £[e"^"x?'(i^ - sn)] =^ C[*W]. □ 

a N^oo 

Corollary 1.13 (Scaled total mass process convergence) 
The above implies for the total mass process that: 

(1.97) C[{x^i^^^ + t)}tm] C[{ I xCt{2){dx)} 

a jv^oo 



[0,1] 



and we have convergence in distribution on path space. □ 
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Remark 9 The random variable W* of {l-l-j^ can be viewed as the final value of the microscopic 
development of the mutant droplets and *yV of U.92\) can be viewed as an entrance value of the 
macroscopic emergence of the mutant type. 

This result of course immediately raises the question how the macroscopic entrance from is 
related to the final value in the smaller original time scale. We have already stated that the two 
exponential growth rates from these two directions are the same. But more is true also the two 
growth constants have the same law: 

Proposition 1.14 (Relation between microscopic and macroscopic regimes) 
The entrance value and the final value are equal in law: 

(1.98) C[W*] = C[*W]. □ 

Remark 10 We note that this result cannot be derived from some soft argument since we relate 
two time scales t and logN + t which separate as N ^ oo and indeed the limits N oo and 
>-oo or t ^ —oo cannot be interchanged in general. 

Remark 11 One could go further studying the relation between final and entrance configuration 
and ask how the limiting Palm distribution fT^, which describes the typical sparse type-2 sites is 
related to the limit C-oo as t ^ —oo of the Palm distribution Ct of Ct, i.e. 

(1.99) A(2)(dx)- ^^*(2)(dx) 



JxCt{2){dxy 

which describes the typical site "immediately" after emergence. 

Note that this new transformed measure is now deterministic in the t — > — oo limit, since in the 
ratio the same random time shift is used in both numerator and denominator. 

Here we have to observe that due to the exponential growth of type-2 sites the state depends on 
when they were colonised and that time point is typically a finite random time back. Define for the 
process given in hl.33\) 

(1.100) ^*{dx) — probability that an excursion of age s has size in dx. 

Note that for any time the total mass 3t([0,l]) is finite but there are countably many non-zero 
atoms in 2t. At time t let 

(1.101) ^ti[ci, b)) ~ number of atoms of age m [a, 6), < a < < 1 

and then define the corresponding size-biased distribution on the product space x [0, 1] of age 
and size given by 

(1.102) Uds.dx)= , ^^(d^Y^(dx) 

lo ^tids)!^ x^il{dx)ds 

By analogy with the Crump- Mode- J agers theory, we would expect that this age and size distribution 
Tt{ds,dx) stabilizes as t ^ —oo with stable age and size distribution T{ds,dx). 
Then we would expect to have the relation: 

(1.103) £_oo(2)(da;) = r((0, oo),dx). 



Corresponding to the limit dynamics of Proposition II . 121 we also have a stochastic propagation 
of chaos result, which characterizes the asymptotic behaviour of a tagged sample of sites as follows: 
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Proposition 1.15 (Fixation dynamic for tagged sites) 

There is a unique limiting entrance law for the process of tagged components: for fixed L G N, 
the sequence in N of laws 

(1.104) m^2{h ^ + m,=^,....L)tm] 

a 

converges as N ^ oo weakly to a law 

(1.105) C[{X2{t))t>o] = C[{{x2il,t), ■ ■ ■,X2{L,t))tm], with X2{t) as t ^ -oo, 
which is if restricted to t >to for every to the weak solution of the SSDE: 



(1.106) dx2{i,t) =c{ X Cti2){dx) ~ X2ii,t) \ dt + sx2ii,t){l- X2ii,t))dt 

VM J 



+ ^y d ■ X2{i, t){l — X2{i, t)) dwi{t), i — l,...,L, 



where wi, . . . , wl are independent standard Brownian motions which are also independent of the 
process {Ct\teK and the latter is given by il.93\) . The behaviour at — oo in hi. 105]) and the equation 
il.l06]) uniquely determine the law for a given mean curve. □ 

Remark 12 Note that X2{i,t) will hit at random times until X2(t) exceeds a certain level de- 
pending on the other parameters in particular c/d, only after that time where X2{t) remains large 
enough the component remains a.s. occupied with positive type-2 mass for all future time. Before 
that time the path will have zeros at an uncountable set of time points with Lebesgue measure zero, 
the set of these zeros will be stochastically bounded from above. This is a consequence of standard 
diffusion theory (see \RW^ ). 

2 Duality 

The key tool to obtain our results is duality between our process and a particle system. 

A duality relation for a Markov process Z with state space E is given as follows. There is a 
process Z' with state space E' such that for a function H : E x E' R, which is bounded and 
continuous we have 

(2.1) Ez„ [H{Zt,Z'o)] = Ezi^ [HiZo, Z[],t > 0. 

The duality we use is a special case of the duality theory developed in jPGsel] and |DGDuality| . 
We begin constructing the dual particle system and then state the duality. 

2.1 The dual process 

We construct now a dual process. This dual process is based on a particle system {r]t)t>o-: whose 
dependence on N we suppress in the notation. 

We need the following ingredients to define the process rj: 

• A^t is a non-decreasing N-valued process. Nt is the number of individuals present in the dual 
process with Nq = n, the number of initially tagged individuals, and with Nt — Nq > given 
by the number of individuals born during the interval (0,i]. 

• (^t ~ {I7 ■ ■ ■ 1 At} is an ordered particle system where the individuals are given an assigned 
order and the remaining particles are ordered by time of birth. 
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Vt = (Cti'^'tiCt) is a- trivariatc process consisting of the above C and 

— TTj: partition (tTj , • • • ,7r|'^'') of Ct, i.e. an ordered family of subsets, where the 

index of a partition element is the smallest element (in the ordering of individuals) 
of the partition element. 

~(,t ■ T^t — > ^Iv*'' S^^ii^g locations of the partition elements. Here is the vector 
of the prescribed space points where the initially tagged particles sit. 

• J^t is for given rjt = (Cti7rt,^t) is a function in Loo(l''^'')- 
Therefore the state of i] is an element in the set 

oo 

(2.2) S= IJ {1, 2, • • • , m} X Part<({l, • • • , m}) x : Part<({l, • • • , m}) — > ^n}, 

rn—l 

where Part<(A) denotes the set of ordered partitions of a set A C N and the set {1, • • • , m} is 
equipped with the natural order. 

It is often convenient to order the partition elements according to their indices 

(2.3) index (ttJ) = min{k G N : fc G ttJ) 
and then assign them ordered 

(2.4) /a&e/s 1,2,3, •••,|7r(|. 
Remark 13 Note that 

(2.5) 7V, = Ki| + ... + kl-l|. 

Remark 14 The process rj can actually be defined starting with countably many individuals 
located in VL^- This is due to the quadratic death rate mechanism at each site which implies that 
the number of individuals at any site will have jumped down into N by time t for any t > 0. 

The dynamics of {rjt} is that of a pure Markov jump process with the following transition 
mechanisms which correspond to resampling, migration and selection in the original model (in this 
order): 

• Coalescence of partition elements: any pair of partition elements which are at the same site 
coalesce at a constant rate; the resulting enlarged partition element is given the smaller of 
the indices of the coalescing elements as index and the remaining elements are reordered to 
preserve their original order. 

• Migration of partition elements: for j € {1, 2, . . . , |7r(|}, the partition element j can migrate 
after an exponential waiting time to another site in fi^v which means we have a jump of ^t(j). 

• Birth of new individuals by each partition element after an exponential waiting time: if there 
are currently Nt individuals then a newly born individual is given index 7Vt_ + 1 and it forms 
a partition element consisting of one individual and this partition clement is given a label 
|7rf_| + 1. Its location mark is the same as that of its parent individual at the time of birth, 

• Independence: all the transitions and waiting times described in the previous points occur 
independently of each other. 
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Thus the process {rit)t>o has the form 

(2.6) vt^iCt, ^t,(et(i),6(2),---,6(kt|))), 

where ^t{j) G ^n, j = l,---,|7rt|, and ttj is an ordered partition (ordered tuple of subsets) of 
the current basic set of the form {1, 2, . . . , Nt} where Nt also grows as a random process, more 
precisely as a pure birth process. 

We order partition elements by their smallest elements and then label them by 1,2,3,... in 
increasing order. Every partition element (i.e. the one with label i) has at every time a location 
in 17 TV, namely the i-th partition element has location ^t(«). (The interpretation is that we have 
Nt individuals which are grouped in |7rt|-subscts and each of these subsets has a position in JIat). 

Furthermore denote by 

(2.7) 7rt(l),...,^t(|7rt|) 

the index of the first, second etc. partition element. In other words the map gives the index of a 
partition element (the smallest individual number it contains) of a label which specifies its current 
rank in the order. 

For our concrete situation we need to specify in addition the parameters appearing in the above 
description, this means that we define the dual as follows. 

Definition 2.1 ( The dual particle system: rjf) 

(a) The initial state ijq is of the form (Coi'^Oj^a) with: 

(2.8) Co = {l,2,---,n}, ^0 = {{!}, •••,{"}} 

(2.9) Co € {^nT- 

(h) The evolution of (rjt) is defined as follows: u 

(i) each pair of partition elements which occupy the same site in Qn coalesces during the joint 
occupancy of a location into one partition element after a rate d exponential waiting time, 

(ii) every partition element performs, independent of the other partition elements, a continuous 
time random walk on flj^ with transition rates ca(-, •), with a{^,^') ~ a(C'iC)j 

(Hi) after a rate s exponential waiting time each partition element gives birth to a new particle 
which forms a new (single particle) partition element at its location, and this new partition 
element is given as label \7Tt- \ + 1 and the new particle the index Nt- + 1. 

All the above exponential waiting times are independent of each other. □ 

Note that this process is well-defined since the total number of partition elements is stochastically 
dominated by an ordinary linear birth process which is well-defined for all times. 



2.2 The duality relation 

This gives in our case (see |DGDualTty ) a duality relation between given in (|l.ip and given 
by dm). 

The duality relation reads in our case for ii, • • • , i„ e {1, • • • , N} all different: 



n „ 

(2.10) E[l[ xfi(t)] = E[expi^^ / n^-'^dz.)], 
fc=i { 
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where IIu denotes the number of dual particles at time u starting with the initial state (|2.8p . 

Wc also need the duality relation for the McKean Vlasov process {{xf{0,t),xf{*,t)), £ = 
1, 2} which is given by xi = 1 — 0:2 and 

(2.11) dx^iO.t) ^ c{e ~ x^{0,t))dt +s{xf'{0,t){l-x^{0,t))dt 

+ ^/d■x^iO,t){l - xf{0,t)dw{t) 

(2.12) xT{*,t) = e. 

The dual process for the McKean- Vlasov processes arises if we replace 

(2.13) {l,---,iV} by {0,*} 

such that all rates of the dual particle on * are put equal to zero and the jump at rate c from is 
always to {*}. Otherwise the system remains. 
The duality relation now reads: 

(2.14) E[{x^{t,O))'']^E[0\^^'''\]. 

3 Proof of results 

We begin in Section [2] to state the duality theory and continue in Subsubsections 13.1113.61 with the 
proofs. We start in Subsection 11.191 with a warm-up exhibiting some of the methods in simpler 
cases. (This is not needed for the rigorous argument). Then we continue with the formal proofs, 
namely proving 

• in Subsection 13.21 assertions related to the large time scale, in particular Proposition II. 3[ 
Proposition 1 1 . 71 and Proposition ll.121 

• in Subsection 13.31 assertions concerning the small time scale and droplet formations, in par- 
ticular Proposition II. 5[ Proposition 11.91 and Proposition II. 101 

• in Subsections 13.41 and 13.51 assertions concerning the connection between the two time scales, 
in particular Proposition 1 1 . 14l 

• in Subsections 13.6113.71 additional assertions where different methods are used, in particular 
Proposition 1 1 . 1 51 in Subsection 13.61 and extensions in Subsection 13.71 

3.1 A warm-up: The case of a single site model 

Before we start proving all the statements made in Subsection II .31 we look at some simpler cases 
and already give in the arguments a flavour of the methods used later on. This subsection may be 
skipped from a purely logical point of view. These simpler systems are single site models. 

In the absence of migration, c = 0, the individual sites evolve independently. In this case it 
suffices to study a system 

(3.1) X{t) = {xiit),X2it)) 

at a single isolated site, for example use xi{t) = xi{l,t), X2{t) = X2{l,t). Recall (|l.llll.6p . 

We begin by showing in this case that emergence in log N time scale does occur in the infinite 
population, that is, deterministic case, i.e. d = but does not occur in the finite population case. 
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that is, random case, where d > 0. This demonstrates the effect of the bounded component (mass 
restriction) and the important role played by space and migration. 

The proof of this result gives us the opportunity to introduce some basic ideas and methods in 
a simple setting. We consider here successively the deterministic and stochastic case and collect 
the proofs with the dual calculations in the third Subsubsection 13.1.31 

3.1.1 Emergence of rare mutants in the single site deterministic population model 

We first consider the deterministic, i.e. d ~ (often called infinite population) model without 
migration, i.e. c — 0. In the case of small mutation rate, ^, we will show with two different 
methods that asymptotically as iV — > oo emergence occurs at times l££-^ + t,t ^ R with fixation 
as t oo: 

Lemma 3.1 Assume that the mutation rate is and d ~ 0, s > 0, c = 0. Then we have: 

3.2 hm X2 +t] =1 <^ n ^ n 

N^oo \ s J s + me** l_ 0, as t ^ -oo. 

Remark 15 This means that in the deterministic case we have emergence and fixation in the 
"critical" time scale {s^^log N) +t as in the branching approximation. 

3.1.2 Emergence of rare mutants in the single site stochastic population model 

The above analysis can easily be extended to the stochastic case (often called finite population), 
that is, d > 0, but now we get a qualitatively different conclusion. We again take mutation rate 
^. Namely we observe that 

Lemma 3.2 Let d > and c = 0, then 

(3.3) C[x2 (aw)] Sq whenever a^/N — > as N oo. 

iV— >oo 

Therefore emergence and takeover does not occur in time scale log N if there is no migration, that 
is, c = 0. □ 

Remark 16 This means that the "finite population sampling", i.e. d > 0, seriously slows down 
the emergence of rare advantageous mutants. This agrees with the corresponding result in the 
branching case where the fluctuation term is responsible for the emergence on 0{N) scale if c = 0. 
This effect is removed as soon as we have migration, which is therefore the key mechanism for the 
emergence if d > 0. 

3.1.3 Proofs using the dual 

Proof of Lemma 13.11 The Lemma IXTl is now proved in two ways with and without duality. We 
consider first an auxiliary object X2(t) for which if we consider the case where the parameter m is 
replaced by ^ we get our x^. 

(i) Analytic approach The quantity X2{t) satisfies the ODE 

dx'? 

(3.4) ^ m{l - X2) + 3X2(1 - X2y, x(0) = 
with solution 

m(e("+")* - 1) 



(3.5) X2it) = 



s + me(™+'*)* 



3 PROOF OF RESULTS 



27 



We can now study this cxphcit solution to the ODE for our choices of the parameters and the 
appropriate time scales where t is replaced by i log + t. (See point (iii) below). 

(a) Duality approach. As a prelude to the analysis of the stochastic case we show how the 
formula (|3.5p can be obtained using the dual representation. 

We assume that Yq = 5i, s > and we choose /o = l{i}- The mutation part of the dual consists 
of the simple transition corresponding to a mutation from type 1 to type 2. This means for the 
dual that we have jumps 1{2} 1, l{i} ~> both at rate m. 

Let {pk{t)}kez+ be the distribution of a pure birth process Nt starting with one individual (and 
each individual independently gives birth at rate s at time t) and let 

(3.6) T = time of first mutation jump. 

Let Pk{t) denote the probability that there were (fc — 1) births in [0,t] and that no mutation 
jump has occurred. 

Then the random state of at time t is given by: 

(3.7) ^^Tt = (l{i})®'=l{.>t}, 

with probability Pk(t) and with probability 1 — ^Y^=\'Pk(f)- Then 

(3.8) xi(i) = E\< X{t), >]=E[J ""''T+dXt] ^E[j ^^F+dX®^^'^] = P[r > t]. 
We have: 

oo 

(3.9) P{T>t)=Y,pk{t). 

k=l 

Noting that the rate of a jump to zero is mn when there are n factors it follows that the 
collection {p„(t), n S N} satisfies the system of ODE's: 

(3.10) %^ = (71 - l)s p„_i(i) - n(s + m)p„(0 

dt 

Pi{0) = 1. 

To determine the {pk{t), fc £ N} we introduce the Laplace transform: 



(3.11) i?(0,O = 5^e"%(i). 



n=l 



Then using p.lOp it can be verified that R{-, •) satisfies 
(3.12) R{e,0) =e^ 

= ^[se"«(n - l)p„-i(0 - (s + m) ^ ne"V(i)] 



dR{e,t) 



dt 



n=l 



= se'—Rie,t)-is + m)—Rie,t) 

= [se'-{s + m))^^R{e,t). 

The solution to this PDE is obtained by the method of characteristics as follows. In symbolic 
notation: 
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so that wc obtain the characteristic curve 

(3.14) e(^+")*(s - (s + 711)6-") = constant. 
Hence the general solution is 

(3.15) R{0, t) = *(e(-^+")*(s - (s + m)e-'^)), 

where ^ is an arbitrary function. Using the initial condition 

(3.16) e'^ = ^{s- {s + m)e-'^) 
we get that 

s + m 

3.17 * w = ■ 

s — u 

It follows that 

(^■^^^ ^(^'*^=.-e(«+™)*(st-7s + m)e-«)- 
Therefore 

s + m 



(3.19) 1-X2(i) = =^(0'*) 



('iiij Large N asymptotic. We now consider the emergence of a rare mutant with m replaced 
by ^ and consider the time scale (C log N) +t, t G M. Wc get 

o _i_ m £. _j_ m 

(3.20) l-a;2(ClogA^ + <) 



so that 1 — 2:2 (C log + 1) ^ 1 or for all t G M depending on C < ^ or C > - . In other words 
the rare mutant emerges and takes over in time scale logN as — >■ 00. We shall derive more 
precise results later in Subsection 13.31 for example in p.647p . Finally, for C — ^, 

s + — s 

(3.21) l-x2(ClogA^ + <) = — ^-^§—^^— asTV^cx). 

Proof of Lemma 13.21 

Wc consider the dual process [rj^ ,FKF^)t>o and define 

(3.22) Tjv = inf{i Ft = 0}. 

For the deterministic case, d = 0, the random time tn was of order log N , but this now changes 
to a much larger order of magnitude. This is made precise in the following Lemma that together 
with the dual identity completes the proof of Lemma 13.21 

Lemma 3.3 Let d> {) and c — Q. 
//lim^^ ^ 0, then 

(3.23) P{tn < On) as N ^ 00 for all t > 0. □ 
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Proof of Lemma [ 

Start the dual J-^ in l^xj. In this case up to the time of the first mutation the number of 
factors in p. 71) is equal to or greater than 1. Therefore for each N, P{tn < oo) ~ 1. 
This implies that for every fixed N 

(3.24) x^it) 1, a.s. as t ^ oo. 

On the other hand the number of factors is stochastically bounded due to the coalescence (which 
occurs at quadratic rate as soon as d > 0). To verify this we can calculate the expected number of 
factors and show that it remains bounded by obtaining an upper bound given by the solution of 
an ODE as follows. 

Consider the birth and death process Ifj"' = representing the number of factors of 

which consists of at time t if we are starting at Hq"^ = ti at one site. The dynamics of this 
object follows immediately from the evolution of {J-^)t>o. The forward Kolmogorov equations of 
the process are therefore given by 

(3.25) p'y(i) = sij - l)p.,,-i(i) + ^{j + l).mj+i(t) - (sj + ^jU - l))Kj(t), 

(3.26) PnAO) = 1- 

Letting _E[n|"']'^ ~ ^^{t) ~ J2'jLa .i''Pn.j{t), we obtain with s = s + |: 

(3.27) —^^smi{t)--m2it)<smi{t)--ml{t), mi{0) ^ n. 
Therefore 

(3.28) E[Il["^] < 



nd{e^* - 1) + 2s ' 
so that 

(3.29) E[J^ ni")^ < ^ log(^ + n(e^* - 1)). 

This implies that if we let 1 > e > and choose a so that 1 — e^" = e, then using p.29p we 
get: 



(3.30) P[tn < ajv] < e + Pi"^ I " n(")rfs > a] < e 

^ Jo 



2m const + [logn + sajv] 



d-a N 

Hence limAr_j.oo P{tn < oln) < £ for arbitrary £ > and the proof is complete. 



3.2 Proofs of Propositions [HI, [EH and [rT2l 

Li this section we consider the large time scale log + t, t G M and prove Proposition 11.31 
on properties of the limit dynamics in this emergence and fixation time scale and the convergence 
results stated in Subsubseetion [T.3.3[ in particular, on the time needed for emergence and the form 
of the process of fixation, i.e. Propositions 11.71 11.121 (Recall the remaining assertions requiring 
analysis in different time scales, in particular Propositions 11.91 and 11.111 on droplet formation are 
proved later in Subsection 13.31 and Proposition 11.151 on tagged sample convergence which uses 
different methods will be proved in Subsection 13.61 ) 
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3.2.1 Outline of the strategy of proofs for the asymptotic analysis 

The proofs for the large time scale proceeds in altogether thirteen subsubsections. Therefore we 
describe here in 13.2.11 first the general strategy and list at the end what happens in the various 
subsubsections. 

The proofs of the results on the — > oo asymptotics for the process in large time scales involve 
calculating the mean of (t) or a;^ (t) respectively of higher moments and hence depend in an 
essential way on an asymptotic analysis of the dual process which we introduced in Section [21 

In the case of two types the required dual calculation involves a system of particles (correspond- 
ing to factors) located at sites in {1,...,A^} with linear birth rate and quadratic death rate at 
each site and migration between sites. Assuming that the initial population consists of only one 
individual and the initial function / is the indicator of type 1 at site 1, then we get a growing 
string of factors of such indicators, until the rare mutation operator (at rate ^) acts on it at which 
time it and therefore the product becomes zero. Hence the main point is to prove that a mutation 
event somewhere in the string will occur in the log N time scale. For this to be the case we have 
to prove that the total number of factors reaches 0{N) in that time scale. 

Consider now the dual particle system rj starting with only finitely many particles. As long as 
the number of occupied sites remains o{N) the migration of individuals leads in the limit iV — > oo 
always to an unoccupied site. More precisely, in this regime the proportion of sites at which a 
collision occurs, that is a migration to a previously occupied site, is negligible. For that reason 
the collection of occupied sites can be viewed (asymptotically as — > oo) as a Crump-Mode- 
Jagers branching process (a notion we shall recall later on). This process grows exponentially 
and the identification of the emergence time corresponds to the Malthusian parameter a of this 
CMJ-branching process. 

In order to investigate the dynamics in the next phase between emergence and fixation, i.e. in 
times ClogA^ + t with i g R the new time parameter, we must on the side of the dual particle 
system now consider the role of collisions of dual particles once the number of occupied sites is 
of order 0{N) and the proportion of occupied sites at which collisions occur is asymptotically 
non-zero. This can be handled by identifying a dynamical law of large numbers described by a 
nonlinear evolution equation. Nonlinear due to the immigration term at a site, resulting from 
immigration from other sites and which in the limit depends on their law. On the side of the 
McKean-Vlasov limit, this makes the evolution of the frequency of type 2 at single site depend on 
a global signal, namely the current spatial intensity of type 2. 

The delicate issue of the transition between these two regimes with and without collision must 
be addressed to fully describe the emergence and to show that this leads to the McKean-Vlasov 
dynamics with random initial condition. For that purpose the system is considered in the time 
scale \ogN + t, with t gM. the macroscopic time parameter. The key element in the analysis is 
the determination of the leading term in an asymptotic expansion of the type-2 mass in time scale 
logiV -I- t with t varying, once we have taken TV — >■ oo, as t — oo. On the side of the dual 
process this will correspond to an expansion of the normalised number of occupied sites as ^ oo 
and then t — >■ oo in the above time scale. However also in the discussion on droplet formation later 
on in subsection 13.31 it is necessary to keep track of the collisions since we need higher-order terms 
in the expansion of the total type-2 mass and these terms involve collisions of dual particles. This 
will be dealt with in Subsubsection 1373 . 5 1 and the sequel. 

To carry out this program we must study the behaviour of the dual particle system in two time 
regimes, namely. 



(3.31) s{N) + t where s{N) oo but s{N) = o(log N) 



3 PROOF OF RESULTS 



31 



and for suitably chosen a, 
log N 

(3.32) — \-t, -oo<t<oo, 

a 

and their asyniptotics as — > oo. The first we call the collision-free regime or pre-emergence 
regime, the second the collision regime. 

A key step in the dual process analysis is a coupling of the pre-emergence and collision regimes 
in order to understand the transition between the two regimes and to be able to consider the time 
scale 

(3.33) - log ^ + t7v with tAT ^ -cx), t at = o(log A^) . 
a 



Outline of Subsection 13.21 

This subsection focusscs on the proofs of Proposition 11.31 (Subsubscction [372.ip on the limiting 
McKean-Vlasov dynamics, Proposition 11.71 (Subsubscctions [5.2.11 - IX^^ and Proposition 1 1 . 1 2 1 on 
emergence respectively convergence to the limiting dynamics of fixation (Subsubscctions 13.2.71 

MM- 

The Subsubsection [572.31 is devoted to the proof of Proposition 1 1.3[ which specifies the limiting 
laws in short and large time scales. 

In Subsubscctions l3.2.4l[3T276l we begin the proof of Proposition 11.71 on the order of magnitude 
of the emergence time. In a first step we exhibit the branching structure for the number of sites 
in 13.2.31 and then analyse the collision free, respectively the collision regime. [3.2.41 to [3.2.61 

The proof of Proposition 11.121 on the limit dynamics is more involved and is broken in six 
preparatory steps which are entirely concerned with the dual process at times ^ log N + 1 and a 
final argument returning to our process, each of these is in one of the separate Subsubscctions 
[3X7)13.2.131 Namely, we study in Subsubscctions [3X71 [3X81 [3X9l and 13 . 2 .101 the structure of the 
dual process more carefully, show in 13.2.111 convergence of the process to the limiting dynamics 
and examine in Subsubsection [5.2.121 the random growth constant in the emergence. Then we are 
able in Subsubsection [372 . 13! to complete the proof of Proposition [T7T2| on the limiting dynamics of 
fixation. 

3.2.2 Proof of Proposition 11.31 (Properties McKean-Vlasov dynamic) 

In order to prove the results on the McKean-Vlasov dynamic recall the duality in Section |2| We 
now go through the different parts of the proposition to prove, but here the hard part is assertions 
d and e. 

(a) This is the wellknown McKean-Vlasov limit for the interacting Fleming- Viot model with 
migration and selection (compare, for example, |DG99| . Theorem 9). 

(b) We want to show that E[xi{t)] — > as i — )- oo if i?[a;i(0)] < 1. This follows by a simple 
application of the mean-field dual {rjt, J-f^)t>o started with J-q = Ij^} and ijq having one particle 
located at site 0. As time proceeds in the dual process the selection operator keeps being applied. 
But the ^-fold application of the selection operator, which now in the two-type case is simply 
obtained as / — >■ f(Si l{i}; produces as surviving terms for J^^ a string of the form (note coalescence 
only changes here the number of factors belonging to distinct variables) 

(3.34) nf2lMi}{uut)) 

and since the total number of factors diverges as soon as the migration rate c is positive, i.e. 
£{t) — )■ oo, the result follows from the assumption i?[a;i(0)] < 1. 
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(c) For existence it suffices to obtain a particular solution (£( (2))tgR where Cq{2) £ P([0, 1]). 
Take the solution (£"(2))t>o with Cq{2) = /i„ with 

(3.35) an := / x/i„((ia;), a„ — > as rt — > oo. 

Jo 

In particular we can take here = 6a„- Choose now (a„)„gN with < a„ < i and a„ — > as 
71 — > oo. Then first of all for every fixed n 

(3.36) lim Q(2) = 5^ 

1 _ 

and there exists r(a„) such that J xC^f^^ ^{2)dx 

increasing and continuous in a. 
Then define 

(3.37) (£"(2))4>_,.(a„) = (£"+^(„^)(2))t>_r(a„)- 

Since for any i > 0, 

(3.38) lim C'l{2) = Sq, 

n— >cjo 

it follows again by duality that lim„_>.oo r{an) = oo. The value at t = — r(a„) tends to Sq as n ^ oo. 

We note that if for a subsequence {nk)k&'i the sequence (£"'')fcgN converges as fc — > oo for some 
s S R, then it converges for all times t > s, as we immediately read off from the duality which 
gives the Feller property (continuity in the initial state) of the McKean-Vlasov process. Therefore 
by considering for example s = —k,k € N we can obtain a further subsequence converging for all 
times t e IR. Now we take such a convergent subsequence of (£"(2))„gN and define £*(2) as the 
(71 — 7> 00) limit. By construction / xCQ{2){dx) = ^ and therefore Cf{2) — > (5i as t — > 00. It remains 
therefore to argue that C*{2) is a solution and is converging to (5o as i — ?> —00. 

The fact that we obtain a solution of the equation is a consequence of the construction together 
with the continuity in the initial state which is immediate using the duality. Namely for every n 
we have a solution for t > —T{an)- Then using duality we see that the limit is a solution as well, 
since it satisfies the duality relation between times s < t; s,t G M.. 

The convergence to Jo as t — > — 00 follows by contradiction as follows. Assume we converge to a 
strictly positive mean. Then applying the duality to calculate the mean at time starting at times 
t 4, —00 we see, that this mean converges to one and therefore we obtain convergence to Si (by 
p.36p ) at time 0. This contradicts the assumption that the mean is ^ at time 0. This completes 
the proof of existence of a solution on (—00, 00) satisfying l|1.26p . 

(d, e) Here we proceed in three steps, first we establish existence of solutions whose mean curve 
has the claimed growth behaviour at —00 and then we show uniqueness of the mean curve with 
this growth behaviour and subsequently the uniqueness of the solution for given mean curve. 

Step 1 Growth of mean-curves 

The first step is to consider the growth of solutions of the McKean-Vlasov equation, Ct{2), at 
—00. We will show two things, 

• first there exists a solution with mean curve m(-) which behaves like m{t) Ae"* for t — >■ —00 
with A G (0,00), 

• secondly that all solutions satisfying the growth condition posed in (d) have mean curves 
satisfying 77i(t) ~ Ae"* as t ^ —00 for some A £ [0,oo). 



= i. In fact by duality we see that r(a) is strictly 
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In fact wc shall combine these two points in our proof. 

First assume that (£t(2))tgR is a solution (and entrance law) with mean-curve {m{t))tes, which 
satisfies: 

(3.39) m{t) > 0, < liminf e-"*m(i) < oo. 

> — oo 

We want to show that e~"*m(t) converges to some A G M^. 

We can chose a sequence i„ with t„ —oo so that for some A € (0, oo) we have 

(3.40) e"l*"lm(i„) = e"l*"l / x Ct„ (2) {dx) A as n oo. 

Jq 

We now abbreviate 

(3.41) ^t„:=£t„(2). 

What we know about fit^ is its mean and from this we have to draw all conclusions on the 
convergence, namely that the limit A does not depend on the choice of (t„)„GN- 

On the other hand we can start with such a measure iJ,t„ as in p.4ip and try to construct a 
solution with a mean-curve {m{t))t£R, m{t) = {A + o(l))e"* as t — )• — oo as the limit point of a 
sequence of solutions starting at time t„ in ^t-^ ■ A solution starting in /it^ at time is denoted: 

(3.42) {Ci)t>t^. 

Then we know, that if (£t(2))igR is an entrance law with Ct^ = Ht„, then by uniqueness of the 
solution starting at t,i we must have 

(3.43) /:t(2) = C(2), fort>i„. 

Therefore our strategy now is twofold. First we verify that in fact the mean curve (m(t))fgK of 
(£t(2))tgR has the form: 

(3.44) m{t) = exp{a{t+-\ogA))+o{e°'*). 

a 

Then we argue that in fact the (£")t>t^ with mean curves satisfying p.40[) converge to an entrance 
law with mean curve satisfying p.44p . We begin with the first point and come to the second one 
(which is then simpler by what we have prepared for the first point) after p.88p . 

In order to carry this program out we must consider the different possibilities we have for 
Ait„ e V{[0, 1]) with mean curve Tn{t„) ^ e^T) ^here we use the abbreviation 

(3.45) e(n) = e"l*"l. 

This is somewhat complicated by the possible interplay of probabilities and sizes in producing a 
given mean. 

We begin by considering two special cases that illustrate the main idea. Let 

(3.46) ^,^=(i-^)S^ + ^Sa,, 

e[n) e(n) 

so that A = ai +pia2. We can distinguish now different cases. The first case arises in our example 
if (small probability for an order one value 02) 

(3.47) pi = 0, ai > 0, 
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in which case the mean curve satisfies 

(3.48) e{n) J x^t„dx — > ai, 

the second case arises if (large probability for very small values) 

(3.49) pi = 1, fli = 0,02 > 0, 

in which case the mean-curve satisfies 

(3.50) e(n) / xfit^dx — ^ const 



given in p.61|) . 

We shall now consider these two cases each with a distinct flavour which then allows us, since 
they catch the key features, to see how we can treat the general case, which we call case 3 and for 
which we need to use an additional trick. 

A key tool in all cases is the duality relation and the fact that the dual process consists of a 
collection of birth and death processes acting independently so that with generating functions we 
can analyse the dual expectation. 

Case 1 

Here we first treat the deterministic case fj,t„ = S °i . We will use the dual representation for 

the McKean-Vlasov limit together with p.l43|) (giving below Wn ^ W) to compute the expected 
value of X2 mass at time t, i.e. the mean of Ct{2). Note that then we consider the time stretch 
\tn \ +t from the time point t„ where we have information about the state. 

Observe that we know that the dual process at time T is a collection of Kt many independent 
birth and death processes, which have evolved for a time T and which in the large time limit 
T — >■ oo satisfy the stable age distribution. Therefore the state of the empirical measure of the 
birth and death processes at times T + 1 is given by the stable size distribution, which arises as 



(3.51) ju[oo,ds)vs 



where Vs is the law of the state of a single birth and death process at time s starting with one 
particle at the occupied site. 

We now fix some time t > i„ and suppress the t-dependence in the notation. We need the 
generating function 

(3.52) G„(z) = £;[z^(*-*-»)] 

of the state of the birth and death process at a randomly chosen site i. To calculate we introduce 
the generating function 

(3.53) G{z)^E[z% 0<z<l, 

where C represents the number of dual factors at a dual site under the stable size distribution and 
{C,{i),i € N) is an i.i.d. collection of such variables. We know that 

(3.54) Gn — > G as n oo, 

(3.55) Gjj — > G' on the imit circle as n — >■ oo. 
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Then noting that G'{1)=B = ^ and using (jXWl) : 



(3.56) 



= E 



n (1 - -f-,)'^'-'--' 



E 



n (1 



e{n)' 



E 



W„e(n)e°*(l+o(l))' 



as n — )• OO 



e{n) 

Ele-^"'^'"'''] 

- 1 - E[W]aiBe'^'' + 0(6^"*), as t -oo. 

From the first to the second hne we have approximated Gn by G and used at the same time the 
convergence to the stable age distribution. To go from the second to the third line we use that 
Wn — > as n — > OO. 

Therefore as t — > — oo: 

(3.57) m{t) - [E[W]aiB]e°'* + 0(6^"*). 
Case 2 

In case 2 the number of dual sites occupied in the dual process which have the value 02 at time 
tn, is denoted Z„(t,a2), and is Bin(W„e(n)e"*, distributed. Hence we get 

(3.58) [\l-x)C-idx)=E[ n (l-«2)«*-*-^)]. 

The r.h.s. of p.58p is approximated as above as in case 1 (see p.56p ) for n — )- 00 by 

(3.59) £;[(G(l-a2))^"^*'"'^" • 

Then Z„{t^ 02) converges as n — > 00 in distribution to a Poisson distribution with parameter We"*. 

For the calculation it is convenient to condition on W and read expectations as expectations 
for given W and write E for this conditional expectation suppressing the condition in the notation. 
We conclude that as n — 00 and then t — > — 00: 



(3.60) E[ {l-x)mdx)] 
Jo 



,[G(l-a2)-l]We° 



as ri — > 00 



-!-[(!- G(l - a2))]VFe"* + 0(e^"'), as t 
Hence as n — >■ 00 and then for i — >■ — 00 we get: 



-00. 



(3.61) E[ xCl\dx)]^[{l-G{l-a2))]E{W)]e''* +0{e'''*). 
Jo 

In other words also in this case m{t) = Const ■ e"* + 0(e^"*). 
Case 3 

We now turn to the general case. The key idea is to focus on the components where we do have 
type-2 mass even as t„ — > —00, which is achieved using the Palm measure. Consider the family 
{/^tnlneN of Palm measures on [0, 1]: 



(3.62) /it„ := 



xfif^^ (dx) 
lo ^ti-t„ (dx) 
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and denote the normalizing constant again as in case 1,2 by 
(3.63) e{n) = {[ xi^tAdx))'^. 



Since the family {11t^}neN is tight (by construction the measures are supported on [0, 1]) we can 
find a subsequence and this sequence we denote again by (t„)neN such that t„ — > — oo and ^t„ 
converges. This means we can find a measure /i on [0, 1] (with /i({0}) = 0) such that 



(3.64) 



x^it^ (dx) 



aSo + as n — >■ cxD. 



/g x^t„ (dx) 

Moreover there exists a scale s„, with s„ > and s„ — )- such that 
lo" xnt„ (dx) 



(3.65) 



/p xnt^{dx) 



We shall study the contributions due to the part aS^ and due to /I first separately in (i) and (ii) 
below and then join things to get the total effect in (iii). 

(i) We begin with the effect from aJo- For this purpose consider the measures (i-'„)rieN on [0, 1] 
defined by setting for every interval (c, d), with < c < d < 1: 



(3.66) z^„((c,d)) 



This allows us to consider the contribution due to the terms leading to aS^ in the expectation in 
the duality relation. Define 

(3.67) {l'n)nm 

as the empirical measure from a sample of size e^""*^*"*") sampled from Vn- Now we consider the 
state of the dual process at time t — tn and acting on the state at time t„ but considering only 
the contributions from sites with values in the interval s„, which means that we can represent the 
state at time by an empirical measure as follows. As n — > oo we have: 



(3.68) 



{l-x)mdx) E 



exp W„e° 



logG(l - VSn)l'n{dy) 



By (|3.126p G"(l) < oo and therefore the error term in (|3.68p is 0(sn 
as n — > oo we get using relation p.65p that: 



Then taking the limit 



E 



exp Wne"" 



logG(l - ysn)vn{dy) 



(3.69) 



> E [exp (-G'(l)Tye"*a)] as n oo 
1 - G'{l)E[W]e°'^a as t ^ -oo. 



Hence in this case we get if there are only contributions from a^o- i-e. = 0, that: 
(3.70) m(t) = (G'(l)S[iy]a)e"* + 0(6^"*), 
i.e. A = G'(l)£;[M^]a. 

(ii) Next we consider the contribution if they are only coming from /i. We have to "return" 
here from the Palm /i to a measure which need not be a probability measure since x~^ diverges 
a± x — Q. Note that for any e > the set of finite measures given by 



(3.71) {e(7i)(la;>e)Mt„(rfa;)}neN, 
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is tight in the space of finite measures on A4{[0, 1]) and we ean choose a convergent subsequence. 
We can then obtain a convergent sequence, we call again (t„)„gN such that: 

(3.72) e(n)(f2:>e)/ii,^((ia;) =4> /i, as oo 
where fi is a measure on (0, 1] with 

(3.73) / xjj.{dx) < oo, i.e. in particular /i([£, 1]) < oo for all e > 0. 
Jo 

We denote 

(3.74) /ig = restriction of fi to [e, 1]. 

By the dual representation we then have under the assumption 

(3.75) is discrete with atoms at xi,X2, ■ ■ ■ ,xm, 

that with Xg^(^ij being the value at site i at time tn the representation: 

NC(t-t„,i)l 



(3.76) 



f\i-x)c-{dx)^E[ n (1 



i=l,...,Kt 



Denote by Zn(t,Xj) the number of dual sites occupied in the dual process which have at time tn 
the value xj. The r.h.s. of (|3.76p we approximate again for n — >■ oo by 



Y[G{l^x,) 



Z„{t,Xj) 



(3.77) E 
We know 

(3.78) C[{Z^{t,Xj)),=i,..,M] = Mult (^W^„e(n)e"*, (^^ ),=!,. .,M 

The assumption p.75p is removed as follows. For a general /ig, we consider the random measure 

(3.79) Zn{t,dx) (the number of dual sites at which X2 G dx) 

which converges as n — >■ oo in distribution to an inhomogeneous Poisson random measure with the 
intensity measure 

(3.80) n{dx)We"K 

For the calculation it is convenient to condition on W and read expectations as expectations 
for given W and write E for this object. We conclude that as n — >■ oo and then t — )- — oo with the 
same reasoning as in the other cases: 



(3.81) E[l {l~x)mdx)] ^^fo\G{i-x)-i}t.idx)We-' ,asn^oo 

Jo 

^ 1-[J{1-G{1~ x))n{dx)]We°" + 0(6^"*) ,ast^ -oo. 
Hence taking first n — > oo we then get for t — > — oo the expansion: 

(3.82) E[f x^{dx)]^[f {l-G{l-x))fi{dx)]E[W]e"* +0{e^"'), 

Jo Jo 
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which then inipUcs that the mean curve satisfies if a = 0, that 

(3.83) m{t) = Ae"' + ©(e^"*), 
with 

(3.84) A^E[W]f {l-G{l-x))fi{dx). 

Jo 

(Hi) Finally we have to use the results in the two different cases where fl=0 respectively a = 0, 
to get the answer in full generality. Combining the expressions for the two contributions we obtain 
as n — > oo and then considering the expansion as t — > — oo that 

(3.85) [\l - x)Cl{dx) —> E re/(G(l--)-l)Md-)M/e-g-l4/G'(l)e-al ^ 0(g2at)^ 

Jo 

and as a consequence for t — !• — oo the r.h.s. is asymptotically given by 

(3.86) - [[j^ [G(l - x) - \]n{dx) + aG'{l)]E[W]^ e"* + ©(e^"*) = A • e"* + 0{e^°") 
with 

(3.87) A=( f [G{1 - x) - l]fi{dx) + aG'{l)) E[W]. 



Finally we have to exclude the possibility that the choice of different subsequences in p.40p . 
p.64p . p.72p (and hence of different values for A) would give a different mean curve. However this 
would lead to a contradiction since this would give two values to the same quantities. Therefore 
the resulting mean curve is independent of the choice of subsequence (in)neN of starting times. 

This means that we have proved that an entrance law with at most exponentially growing mean 
has the property that the mean curve has asymptotically as t -> — oo the form 

(3.88) m(t) = A-e"* + Cl(e2"*), 

with A given by the formula p.87p which completes the first of the two points we specified around 
(ICT) . 

We next need to argue that in fact such an entrance law exists by showing that actually £" 
(from (|3.42p ) converges along a subsequence. Due to the continuity of Ct in the initial state at 
time to, the tightness is straightforward. We have to show that first of all the limit is a solution 
and secondly the mean curve has the desired asymptotics. To show that we obtain in the limit a 
solution we argue as in the proof of part (c) of the proposition. It remains to verify the asymptotics 
of the mean curve. 

For this purpose we use the same line of argument as above for the first moment of X2{t) to 
show that also all higher moments of X2{t) converge as n — > cx) for every fixed t, so that we have 
convergence of the processes using the Feller property. For that we observe that we then simply 
have to start the dual process with fc-initial particles and then we carry out the same calculations, 
since the CMJ-theory works also for the CMJ-proccss started with a different internal state of the 
starting site. We omit the straightforward modification. 

Step 2 Uniqueness of solution with the t — > — oo asymptotics 

We next show that if the mean curves of two entrance laws differ at most by o(e"*) as t — > — oo, 
then they are identical. This then allows to conclude that if we prescribe the value A then there 
is exactly one entrance law with this asymptotics as t — >■ —oo. 
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For that purpose we consider two solutions {CDtewi, {C^)teM such that the mean curves (m^(i))tgR 
satisfy for £ ~ 1,2 that m^{t) ^ Acxp(at) a.s t ^ —oo. vbbbbbb We consider a sequence (t„)mGN 
with tn — > — oo as n — > oo and we fix a time point t G M. Then we define the two processes we 
apply the duality relation at time t 



(3.89) m'')s>t„ , (^s^ is>u 
which have at time t„ the initial distributions given by 

(3.90) resp. £?^. 

We apply the duality relation at time t. Our goal is to show that the mean curve satisfies 

(3.91) m^{t)^m'^{t), V t e K, 
by proving that for every t G K 

m"'i(t) = m"'2(t) _,. o(i) as n oo. 



(3.92) 
W 

(tn) — m 
(3.93) 



We first verify this in Case 1. Let be two solutions as in case 1 but such that 

Then by 



< E 



= E 



Kt- 



ein) 



e[n) 



Kt 



Kt- 



n ^"(1 



Ql 
ein) 



Kt- 



< i;[G"(l)W„e(n)e"*o(— ^)] ^ as n 
e(n) 



ai 

e(n) 



e{n) 



oo, 



where we have first used that we can split the expectation in the duality relation into one over the 
initial state and then over the dual dynamic, furthermore for z £ [0, 1], G'Jj(z) < G'Jj(l) < G"(l) 
(the latter by coupling). Hence we have = in this case. 

In case 2 there iis o ( j change in the mean curve resulting from the change of 02 to 02 + 



1 

e{n] 



Moreover given a change of the frequency 



^ to ^ 



1 

e(n) 



we have Zn{t,a2) 



1 



and the limiting Poisson distribution is not changed by the 



is Bin(M/„e(n)e"*,^4 
o perturbation. 

Again, the general case follows by a combination of the arguments of cases 1 and 2 exactly as 
in case 3 in Step 1. 

Step 3 Uniqueness given the mean curve 

To complete the proof for uniqueness of the entrance law with specified behaviour as t — > —00, 
it remains to show that for a given mean curve {m{t))t^^ there is a unique solution (£j)igR to the 
McKean-Vlasov equation having this mean curve. Since the McKean-Vlasov dynamics is unique 
for t>to it suffices to prove uniqueness of L^^ for arbitrary ig- 

Now consider the tagged component in the McKean-Vlasov system (x^(l, t))t>t„ assuming that 
the mean curve (TO(i))tgR satisfies 

(3.94) limsupm(t) = 0. 



3 PROOF OF RESULTS 



40 



Then Cf is the law of a tagged component conditioned on the reahzation of the mean curve and 
this conditioned law is a weak solution of the wellknown SDE 

(3.95) dx{t) = c(m(t) - x{t))dt + sx{t){l - x{t))dt + \/x{t){l - x{t)dw{t), 

(cf. (|1.2ip ) which has a unique weak solution given an initial value at time ig- However we must 
prove the existence and uniqueness of a solution with time running in K. and satisfying 

(3.96) lim x^{l,t) = 0. 

Wc construct a "minimal solution" x^{l,t) for this problem by starting with a sequence of 
solutions X(^n){t) of (|3.95p (driven by independent Brownian motions w„(i)) starting with value 
x^itn) = at times (tn)neN with t„ — ^ —oo. Wc then construct a coupled version {x™ : m > mo} 
by coupling paths x"* and x", n < m at a time where they collide. It can be verified that the 
coupled system {(2?"(t))tgR_„gN} is a stochastically monotone increasing sequence as ti — )■ oo with 
limit x'^{l,t). If we assume existence of a solution of p.95p with time index R and mean curve 
m{t) we then know it will be stochastically bounded below by x^. 

Next we need uniqueness in law of the solution and we shall show that it must agree with 5;^ , 
which then automatically must also be a solution. We observe that any solution to p.l06p must 
have zeros at — |t| for arbitrarily large |t| (cf. classical property of Wright-Fisher diffusions |S1|V 
If it has a zero at t* > tn, then we can couple it with Xn and therefore we can couple the diffusion 
with the minimal one therefore the considered diffusion must agree with a version of x"^ after the 
coupling time. Combined with the uniqueness property of the McKean-Vlasov equation this also 
proves uniqueness of £j , t > tQ. Since to is arbitrary, we obtain the uniqueness for all i G M. This 
completes the proof of (e) of the Proposition 11.31 

(f ) Finally, we consider a random solution {Ct : < G M} to the McKean-Vlasov equation. Since 
Ct is a.s. a solution and since by assumption it satisfies a.s. the growth condition with A G (0, oo), 
it follows that it is given a.s. by a (random) time shift of the standard solution. 

3.2.3 The structure of the dual process and a Crump-Mode-Jagers process 

The key tool for the proof of the emergence results is the dual representation for the system of 
N exchangeably interacting sites, namely, the dual process (ritjJ^j^) (note the dependence on N is 
suppressed in the notation). 

Since in the case of two types, K = {1, 2} the two frequencies of type one and two add up to 1, 
it suflices to determine the law of xiC-^^ + 1) and SAr(i^) + 1, {1}) in the limit iV ^ oo. This 
will, as we shall see in Subsubsection [3.2.111 involve computing (£, fc)-moments, i.e. the product of 
fc-th moments at ^-different sites for the TV-site system and for the random McKean-Vlasov limit. 
The calculation is carried out using the dual process started with k particles at each of i different 
sites. We denote the number of particles in the dual process at time t with such an initial condition 

by 

(3.97) nf'*^' , we often write Ilf if {k,i) are fixed. 

First, to prove that the mass of the inferior type at a tagged site goes to zero in time scale 
\ogN -\-t as iV oo and then t — )■ oo, it suffices to work with i?[a:;i(-)]. The calculation of this 
expectation involves only one initial particle for the dual process. To carry out the calculation we 
assume that a:;i(0, i) = 1 ior i — 1, - ■ ■ ,N and take the initial element for the dual process 

(3.98) vo = S^„, 

i.e. one particle at a tagged site in io G {1, • ■ • i N}. Note that because of migration the law of this 
process depends indeed heavily on iV if t is large enough. 
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The number of factors at each site form a system of dependent birth and death processes with 
immigration, i.e. a process with state space (Nq)^, denoted 

(3.99) {C{t))t>o with C^(t) = {C^(t, z), z = 1, . . . , N}. 
In other words 

N 

(3.100) nf'" = 5]c^(t,i), 

and C^{0) has exactly ^-non zero components each containing exactly k factors. 

Next we describe the dynamic of C,^ . Consider first a single component i. If the current state 

of C,^ at a site i is a; G Nq, then 

the immigration rate of one new individual to this site is c{N^^(n^''''^ ^2;)), 

(3.101) where every individual at the other sites has the same chance to be selected for 
migration to i, 

N - I 

(3.102) the emigration rate for one individual at this site is c — — — x, 
the individual moves to a randomly chosen different site, 



(3.103) the death rate, i.e. rate for the death of one individual at this site is 

{d/2)x{x- 1), and 

(3.104) the birth rate for one new individual is sx. 

The dependence of the birth and death processes {({t, i))t>o, i = 1, • • • , iV at different sites arises 
from the migration transition in (j3.10ip . namely the immigration is coupled with the emigration 
at another site. 

Note that because of the exchangeability of and the initial state (0) it suffices for the 
dual to keep track of 

• the number of occupied sites, 

• the number of individuals, i.e. factors, at each occupied site. 

We now introduce the notation needed to describe the dual population. First the process 

(3.105) {K^}t>o 

that counts the number of occupied sites at time t. Secondly the process recording the age and size 
of each site (age is counted from the time of "birth" , that is, the time when the site first became 
occupied the last time). Note that each site is uniquely identified by its birth time. Equivalently, 
due to exchangeability it sufiices to keep track of the number of occupied sites and the empirical 
age and size distribution at time t, a measure on K+ x N denoted 

(3.106) *^(t,ds,dy), 

where {t, (a, b], y) denotes the number of sites in which the birth time, that is, the time at which 
the initial immigrant arrives, lies in the time interval (a, b] and the current size is y G N. 
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Wc will use the abbreviation 

(3.107) *^(t, ds) = *^(t, ds, N). 

Consider next for a fixed t e M+ the eoUeetion of processes 

(3.108) {{C{u))u>s,se suppvI/^^(i,.,N)}, 

which denotes the size of the population at a site having birthtime s, that is, one particle arrives 
at time s. In the absence of collisions these processes for different birth times are independent 
birth and death processes and their distributions depend only on their current age. However in 
the presence of collisions these are coupled (by migration) and no longer independent. 

The number of occupied sites fluctuates due to migration to new sites respectively jumps from 
sites with one particle to other occupied sites. Note that the probability of a jump of the latter type 
is non-negligible only if the number of occupied sites is comparable to the total number of sites. 
Hence up to the time that the number of occupied sites reaches 0{N) the number of occupied 
sites is (asymptotically) non-decreasing and the birth and death processes at different sites are 
asymptotically independent. 

We obtain an upper bound for the growth of {K^)t>o if we suppress all collisions and assume 
that always a new site is occupied. In order to construct this process, we enlarge the geographic 
space from{l, • • • , TV} to N and drop in (|3.10ip all jumps to occupied sites replacing them by jumps 
to a new site, namely the free one with the smallest label. We refer to this process as the McKean- 
Vlasov dual since it arises by taking in our model of N exchangeable sites the limit as — >■ oo of 
the dual process. This process is denoted (if we want to stress how it arises we add the superscript 
oo): 

(3.109) (Ct) = (Ct(l),Ct(2),---) 

and the process of the total number of individual by 

(3.110) {u[''^'^)t>o, 

and ^ has the following markovian dynamic. 

If we have at a site i G N occupied with a: e N individuals then we have 

(3.111) one death at rate -^x{x — 1), 

(3.112) one birth at rate sx 

and if J is the smallest index with Ct{j) = then 

(3.113) at rate c(^{i) an individual moves from i to j. 

Since in the duality relation we only need the occupation numbers of occupied sites, we can 
ignore migration steps of single factors to another unoccupied site. Therefore in the migration rate 
in (j3.113|) we take 

(3.114) cCt(z)l(^>2) instead of cCt(i)l(a;>i)- 

This way we obtain, completely analogous to p.lOSp - (|3.108p new processes which we denote 

(3.115) iKt)t>o, («-(<, ds, dj))t>o, {{Csiu))u>s, s e M+}. 
Then we know that for every t £ M"*" the collection 

(3.116) {{Cs{u))u>s,s e supp«'(i,-,N)} 
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consists of independent birth and death processes. 
We can show that 

(3.117) < Kt , stochastically for all t > and N eN. 

Namely on the r.h.s. we suppress collisions and at each jump occupies a new site. Since after a 
collision we can have coalescence with one of the particles already at that site, a coupling argument 
shows that we have fewer particles as in the model with collisions, in fact strictly fewer with positive 
probability. 

Then the dual (774, J^^^) on the A^-site model evolves exhibiting the following features. The dual 
population consists of an increasing number, 11^, of (l{i}) factors (due to birth events related 
to selection). Each factor (l{i}) (and therefore the product) can jump to by rare mutation at 
rate ^ (because the factor jumps to (1{2}), which becomes in the dual expression for E[x^ {t)] 
by (|2.10l with n = 1) since (0) = 0). Therefore asymptotically as — >■ 00 to have non-zero 
probability of such a jump to zero to occur, we need 11^ ^ 0{N). This means that asymptotically 
the probability that a mutation event occurs becomes positive only for sufficiently large times such 
that the number of (l{i}) factors in J^^ reaches 0{N). 

In order to see the meaning of this behaviour for the original process recall the first moment 
duality relation (assuming (0) ~ 1) rewritten in terms of the quantities introduced p.lOS^ - 
(IXTMll : 

E[x^it) 

(3.118) 

In order to prove Proposition ll.7[(|1.66p . we therefore need to show that asymptotically the 
probability that a mutation event occurs by time Tn + 1 goes to as iV — >■ cx) and t ^ —00. 

A complication in the analysis of the dual process arises in that if a migrant lands at a previously 
occupied site, that is, a collision occurs, then the different birth and death processes are not 
independent due to coalescence with its quadratic rate and we must take this into account when 
reaches 0{N) since there is then a non- negligible probability of a collision. 

We will therefore study in Subsubsections l5^.2.4l and r3.2.5l first the collision-free regime of the dual 
dynamic and in Subsubsection [X^^ its consequences for the process and then in Subsubsections 
I3.2.6ll3?2.10l the regime with collisions. 

3.2.4 The dual in the collision-free regime: the exponential growth rate 

Due to coalescence (if d > 0) the total number of (l{i}) factors is locally stochastically bounded 
but due to migration the number of occupied sites can grow and with it the total number of factors. 
We need to find a such that the number of factors is at time logiV -I- t of order iV for t > to 
and large and the integrated lifetime of all factors up to that time is 0{N) as iV — > 00, with 
smaller and smaller number of factors as i — > —00 and a diverging number as t ^ 00. We therefore 
will fix a candidate a and then show lower and upper bound on the number of factors. We will 
introduce this a as the exponential growth rates of the number factors as A'^ — >■ 00 as follows. 

If c > and N = 00 (i.e. "mean- field migration"), then the number of factors (i.e. H^) would 
grow exponentially fast as we prove below. We will now first work with this scenario of mean-field 
migration in Step 1 providing a candidate for a and an upper bound. In the next subsubsection we 
return to the effect of collisions, which can arise if A^ < 00 and we have migration to a previously 
occupied site to get a lower bound. 



C{u)^'^{u,dv) ] du 



E 



exp 



m 

N Jo 
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In order to verify (|1.66p . first note that by Jensen and p.llSp 

C{u)-f^{u,dv)]du 



(3.119) E[x^it)] > exp i^^^^E 

In the next three steps we analyse first (u) given for r from up to u to get an estimate for 
the growth rate of the integral in the r.h.s. in terms of the process of (|3.119p and then in the 
second step we focus on the growth of in u and in third step will then be simple, we combine 
the results to get that the a we have chosen is an upper bound for the exponential growth rate (in 
t) of the double-integral in the r.h.s. of p.llQp . 

Step 1 Sufficient condition for lll.66\) 

We next show in order to control the r.h.s. of p. 1191) that 

(3.120) Mm sup E[C{u)\a{K^ : s < u}] < L 

and L is non-random, i.e. a constant. Given a and recalling p.ll9p in order to establish the upper 
bound on the emergence time, it sufltices once we have p.l20p to show that: 



log N ^ 

(3.121) lim lim 4 / ° E\K^]du = 0, 



which we will do in the Step 2. 

To prove (|3.120p consider the collection of auxiliary processes (here l E K"*") 

(3.122) (Co,t(0)t>o, 

which as it turns out bounds our process from above in a site colonized at time at time t for 
proper choice of the parameter l. It has the dynamic of a birth and death process with linear birth 
rate sk, quadratic death rate |fc(fc — 1), emigration at rate cfcl(i,>2) and immigration of a particle 
at rate l. The parameter l we choose later. The case l — corresponds to the McKlean-Vlasov 
dual, i.e. the dual in the collision-free regime. The process has a unique equilibrium state 

(3.123) Co,oo(0, 

and is ergodic (with exponential convergence to equilibrium). 

We have (using a standard coupling argument) in stochastic order: 

(3.124) Co,oo(0) < Co,oo(0- 

A standard coupling argument can then be used to show that for all t > and every s the random 
variable (recall we start in one particle) 

(3.125) Cs.ti^) is stochastically dominated by Co.oo('-)- 

Moreover writing down the differential equation for the second moment of (^o,t it is easy to verify 
that for every i> 0: 

(3.126) £;[(Co,oo(0)'] <oo. 

With our McKean-Vlasov dual process we can associate a mean-field birth and death process 
where the parameter l is time-dependent, namely the immigration it is the current mean of the 
process. This process arises as the limit — > oo of the process with migration between N sites 
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according to the uniform distribution and starting with a positive intensity of particles. The mean- 
field birth and death process has a unique equilibrium which we reach if we start with one particle. 
Namely it can be verified that the fixed point equation 

(3.127) L,^E[Co,ooM] 

has a finite solution which gives the expectation for the equilibrium of the McKean-Vlasov dual 
and that in fact determines the McKean-Vlasov equilibrium. 

To show this we consider the nonlinear Kolmogorov equations for the process (Co,t(''))t>o which 
are spelled out in all detail later on in (|3.268p where we set in p.268p a{t)u{t) = l. More precisely 
let Pk.e{t) — Pk.e{i-,t) denote the transition probabilities and set 

oo 

(3.128) mt(t) =^jpi,,(i,i), 

OO 

(3.129) m{L) = \im^jpij{L,t). 

To look for equilibria we look for fixed points m{i) = l. Using coupling we can show that m{i.) is 
monotone increasing, continuous, ultimately sublinear in i and limt_j.o m(t) > since the smallest 
state is 1 if we start in 1. Therefore there exists a unique 

(3.130) largest solution t* of the equation m(t) = l with mt < m{i*). 

If we insert this t* as parameter the mean remains constant and the equilibrium of the McKean- 
Vlasov dual process, is the unique equilibrium of the Markov process (Co,t(''*))t>o- 
Wc see again by coupling that for l < t*: 

(3.131) Co,oo(0) < Co,oo(0 < CoM'-l- 

We now show that this provides the bound required in p.l20p . Note that at a tagged site in 
the exchangeable system of N sites the number of particles is given by 

TT^ IT 

(3.132) Co"!* = Co,iiOu<t) with ^ < ^, 

where lit is the number of particles in the McKean-Vlasov dual process (without collisions) , which 
can be constructed on the same probability space as 11^, such that the inequality on the r.h.s. 
holds and such that 

(3.133) Co'!t<Co,t((^)«<t)<Co,oo(^*). 

A standard mean-field limit argument then shows that {(,^^,11^) converges in law as iV — > oo to 
iCo,t{{'mu}u<t)', Kt), where in the latter the two components are independent. 
Therefore using ()3.131|) and the construction of l* wc see that: 

(3.134) hm S[C|nf : s < <] = E[Co,ti{m^.}u<t)] < E[Co.M] = i* 

and this proves p.l20[) . 

Step 2 A Crump- Mode- J agers process gives a Malthusian parameter a. 

To show (|3.12ip and to obtain a candidate for a we can use the collision-free mean-field dual 
Ht in particular its ingredients Kt and Q. 
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The process K in (|3.115p has some important structural properties. Recall the process counts 
the number of occupied sites, each of which has an internal state, which consists of the number of 
particles at these sites. Each of the particles exceeding the first particle can migrate and produce 
a new occupied site. Whenever a new site is colonized then an independent copy of the basic one 
site process starts and evolves independently of the rest. 

To analyse this object we recall the concept and some basic results on supercritical Crump- 
Mode-Jagers branching processes and in particular its Malthusian parameter a. Such a process is 
defined by the following properties. 

The process counts the number of individuals in a branching population whose dynamics is as 
follows. Individuals can die or give birth to new individuals based on the following ingredients: 

• individuals have a lifetime (possibly infinite), 

• for each individual an independent realization of a point process ^(t) starting at the birth 
time specifying the times at which the individual gives birth to new individuals, 

• different individuals act independently, 

• the process of birth times is not concentrated on a lattice. 

Let {Kt)t>o be a process with the described structure. The corresponding Malthusian parame- 
ter, a > is obtained as the unique solution of 

/•oo 

(3.135) / e'"* fi{dt) ^1 where n{[0,t]) = E[^{[0,t])], 
Jo 

with 

(3.136) ^{t) counting the mmrber of births of a single individual up to time t, 

(see for example |J92| . [N] equation (1.4)). 
In our case we have 

(3.137) ^l{[0,t])^c^ E[Co{s)l^Co(s)>2)]ds. 

Jo 

Remark 17 Note that in our model particles at singly occupied sites do not migrate. We can 
suppress the emigration step if only one particle is left, since in that case everything would simply 
start afresh at some other site. We have done this so that in the pre-collision regime the number 
of occupied sites equals the number of migrating particles. In this case the rate of creation of new 
sites due to migrations from a given site is given by Co('S)lco(s)>2 o.i^d the death rate when C,q{s) = 1 
IS 0. 

However we can also consider the original model in which particles at singly occupied sites do 
migrate. Then sitej have finite lifetimes and the rate of production of new sites by a fixed site at 
time s is given by Co(s) where now Co(s) has death rate c when Co{s) = 1- Then we have instead of 
^3.137^ the equation 

(3.138) ti{[0,t])^c f E[(o{s)]ds. 

Jo 

However the Malthusian parameter describing the exponential growth of the number of sites occupied 
by at least one particle is the same in both cases as is shown by explicit calculation. 
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We therefore define a by (recall for (|3-115p and above): 

oo 

(3.139) c J e— 'i?[Co(s)l(Co(s)>2)]ds = 1- 



The first thing we want to know is that in our case 

(3.140) ae{0,s) if d > and a e (0, s] if > 0. 

In the case of our dual process Kt < Nt where Nt is a rate s pure birth process and therefore 
a < s. 

To show that the Malthusian parameter satisfies: 

(3.141) a > 

note first that when d > still no deaths (in the number of occupied sites) occur and that a lower 
bound to the growth process is obtained by considering only those migrants which come from 
the birth of a particle at sites containing sofar only one particle and which then migrates before 
coalescence or the next birth occurs. This process of the number of such special sites is a classical 
birth and death process with in state k birth rate s( ^_^ ^_^^ )fc and death rate zero which clearly has 
a positive Malthusian parameter (if c > 0) and hence (|3.141[) holds. 

Next to see that if d > then a < s, note that a non-zero proportion of the new particles 
generated by a pure birth process at rate s dies due to coalescence and therefore a < s in this case. 

If we know the Malthusian parameter, we need to know that it is actually equal to the almost 
sure growth rate of the population. It is known for a CMJ-process {Kt)t>o that (Proposition 1.1 
and Theorem 5.4 in [N]) the following basic growth theorem holds. If 

(3.142) S[Xlog(X V 1)] < oo, 

oo 

where here X = J e~"*(i^(i), then 



(3.143) lim — \ = W, a.s. and in Li, 

t^oo s 

where is a random variable which has two important properties, namely 

(3.144) > 0, a.s. and E[W] < oo. 

We now have to check that (|3.142p holds in our case. 

oo 

We know using p.l39p that / ue^""/i((iu) < oo, since we can bound the birth rate by the one 



in which the number of particles at each site is given by the equilibrium state C{C,ao), which has 
a finite mean. This also implies that assumption p.l42p holds. Namely, in equilibrium we can 
estimate 

oo oo 

(3.145) E[X^] < J J e^"(^+*)£;[de(s)dC(i)] < oo, 





by using Cauchy-Schwartz and using p.l26p . 
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Remark 18 In the above we considered the dual J-^ starting with one factor (particle) at one 
initial site. We can also consider the case with k initial particles at one site. In this case the same 
exponential growth occurs since \3.131^ still holds but the corresponding random growth factor VF^*^) 
in {3.143^ now depends on k. Similarly we can start with k particles at each of £ different sites and 
again get exponential growth with the same Malthusian parameter a but now with random variable 
\Y{k,e) _ y/jjg ^j^ii frequently used in the sequel of this section. 

Step 3 Completion of argument 

To complete the argument we note that p.l2ip foUows from (|3.143p (which imphes that Kt = 
e-^^Wt with Wt W) and p.ll7|) since 

(3.146) E{ / Kfds) < Const ■ -E{W)e^°s^~''' . 

Jo a 

We saw earher that p.l21|) is sufficient for (|1.66|) and the proof is complete that emergence does 
not happen before time a^^ logN a.s N ^ 00. 



3.2.5 The dual in the collision-free regime: Further properties 

We shall need for the analysis of the regime with collisions some further properties concerning the 
dual in the collision-free regime. First, we obtain further properties of the growth constant W, 
more precisely its moments, and then the stable age distribution which among other things allows 
to give an explicit expression for the growth rate a. The random variable W is defined in p.l43p 
in terms of the process {Kt)t>o introduced in (|3.115|) . that is, we consider the dual process without 
collision. 

Step 1 Higher moments of W . 

In the sequel we will perform moment calculations which use: 



Lemma 3.4 (Higher Moments ofW) 
For all n, 

(3.147) EiW] < 00. □ 

Proof of Lemma 13.41 By [BD, . Theorem 1, W has a finite ?ith moment if and only if 



(3.148) E 



/ e-^^d^s) 
Jo 



< 00, 



where ^ (t) denotes the number of particles that have emigrated by time t from a fixed site starting 
with 1 particle. We will verify this condition for the second moment. 

-, 2 



(3.149) 



E\ 



e-"'d^is) 



= 2E 



00 pOO 



-as -a(s+s')) 



das')das) 



c'E 



c^E 



00 poo 



Cois)E[Cois')\Cois)]ds'ds 



Js 

00 poo 



Js 



e-"%o{-s)Co{s')ds'ds 



Recall that here Co is a birth and death process with birth rate sk and death rate cklk>2+^d-k{k—l). 
But then by explicit calculation sup £'(Co(s)^) < 00, and therefore 



(3.150) 2E 



00 pOO 



e-"Xois)Coi-s')ds'ds 



< 00. 
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This completes the proof that E{W^) < oo. 

Recalhng that Co(^) is for all t stochastically dominated by the equilibrium distribution {pk)keN 
of this birth and death process and that the latter satisfies for some C < oo, 

(3.151) Pk<^, for all fc= 1,2,--- 

fc! 

it follows that for every n G N, 

(3.152) sup£;[(Co(s))"] < oo. 

Then a similar argument to the second moment case argument given above verifies that nth mo- 
ments of W are finite. 

Remark 19 Alternatively to the above we can argue as follows. In the pre-collision regime we can 
also represent the particle process as a branching Markov chain, more specifically a branching birth 
and death process (linear birth rate and quadratic death rate), that is a counting measure-valued 
process, on N, 

(3.153) Kt e M{n), 

where At{k) counts the number of sites occupied by exactly k partition elements. In other words the 
individuals of this new process A correspond to the occupied sites of the dual particle system and 
the location of individuals is now the size of the population at the sites of the dual particle .system. 
The dynamics of {At)t>o therefore given by 

• The branching. Individuals branch independently with a location- dependent branching rate, 
namely, an individual at k dies at rate ck and produces two new individuals with new locations, 
namely one individual at k — 1 and a second one at 1 . Since sites of .size that would be 
produced if a site of size 1 dies would not contribute to the production of new sites, they can 
be ignored. For this reason we suppress these deaths and set the death rate at k to be cklk^i. 

• The motion: the motion between locations is given by a Markov process (C(0)t>o on NU{oo}, 
namely, a birth and death process {(^{t))t>o, where the birth rate is sk and the death rate 
dfc(fc-l)/2. 

In other words the CMJ process can be embedded in the branching birth and death process and 
the number of individuals in that process is the same as the number of occupied sites in our original 
CMJ-process. 

We work with the probability generating function, Mf , of branching birth and death process Aj 
and use this to compute the distribution of the number of occupied sites as follows. 
Define the generating function: 

oo 

(3.154) Aft(fc,z) = £;5jz^'(iw)] =^p(i,fc,j>^ 0<z<l, 

where we write Kt{f) ~ YlTLi f{i)^t{k). 

Then Mt satisfies the standard functional equation 

Mt{k,z)^ i;fe[e-'=-/'o(f("»''^]z 
^^■^^^^ +Ek [/o e--/o au)du . . M,_,(C(s) - 1, z)Aft-.(l, z)ds 

with Mt{0, z) = 1. Here Q is a birth and death process with birth rate sk and death rate ^dk{k— 1). 
The above equation can be solved recursively for the coefficients of z^ . 
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This relation can also be used to obtain moment formulas by evaluating the appropriate deriva- 
tives at z = \ . 

Consider the semigroup {Tt)t>o of the migration defined by 

(3.156) TJik)^Ek[e--foCi^)d^f{C(t))], keN 
and let 

(3.157) /,(j) :=/, ^=1,2,3,.... 
Let for £ = 1, 2, 3, • • • and k G N: 

(3.158) m,(t,fc)-Sfe((At(l{N}))'). 
Then the following moment relations hold: 

(3.159) m^{t,k) = TtfQ{k) + c f Ts{fi{-)[mi{t - s,l) + mi{t ~ s, ■ - l))]}{k)ds. 

Jo 

(3.160) m2{t,k)= Ttfo{k)+c [ {Ts{fi{-)-[{m2{t-s,--l) + m2{t-s,l)) 

Jo 

+ {mi{t - s,l) ■ mi{t - s,- - l))]}{k)ds. 
Also note that mi(t, k + j) < mi{t, k) + mi(t,j) so that mi(t, k) < kmi(t, 1). Therefore we get 

from nim 

(3.161) m{t,l) <Ttl + c f {T,k){l){mi{t - s,l)ds + c f T,{k{k - l)mi{t - s,l)){l)ds. 

JQ JO 

LetM(X), F{X), G(A) and denote the Laplace transforms of mi, T^l, Tsk+Ts{k{k—l)) as functions 
of 3, respectively. 
Then 

(3.162) M{X){l^cG{\)) < F{X). 

We need X > Xq so that this is invertible. This gives an upper bound for the Malthusian parameter 
a. 

Step 2 Stable age and size distribution 

Return to the McKean-Vlasov version of the dual process (which is the dual in the collision-free 
regime in the limit N oo). If we consider for every time t for each individual (i.e. occupied site 
in our case) currently alive its age and size, then we can introduce a random probability measure, 
the normalized empirical age and size distribution of the current population, which we denote by 

(3.163) U{t^du,j) = ^^^^p^. 

Fit 

The marginal random measure U{t, du, N) converges in law (we use the weak topology on measures) 
as t — > oo to a stable age distribution 

(3.164) W(oo,dw,N) on [0,oo), 

according to Corollary 6.4 in [N], if condition 6.1 therein holds. The condition 6.1 in jN] or (3.1) 
in |JN| requires that (with fi as in p.l35p ): 

/>oo 

(3.165) / e-^*fi{dt) < oo for some /3 > 0. 
Jo 
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The condition 7.1 in [N] holds for any /3 > since the local (at one site) population of the dual 
McKean-Vlasov particle system {rjt)t>o given by (Co,t)t>o goes to a finite mean equilibrium. 

Since the distribution of size at a site depends only on the age of the site, it follows that 
U(t,du, •) converges to a stable age and size distribution, i.e. 

(3.166) U{t, •, •) =^ U{oo, •, •), as < — > oo in law. 

We can now also calculate a and the asymptotic density of the total number of individuals B 
as follows unsing a law of large number effect for the collection of independent birth and death 
processes at the occupied sites. Namely the frequencies of specific internal states stabilize in the 
stable size distribution, while the actual numbers diverge with the order of Ktj^ as A'^ — > oo. Given 
site i let > denote the time at which a migrant (or initial particle) first occupies it. Noting 
that we can verify Condition 5.1 in [N] we have that 

1 f 

(3.167) lini — y^ (nit — Ti) — / E[(oiu)]U{(X),du) — B {a constant), a.s., 

by Corollary 5.5 of [^. The constant B in (j3.167p is in our case given by the average number 
of particles per occupied site and the growth rate a arises from this quantity neglecting single 
occupation. Namely define 

oo 

(3.168) a = c^jZ^(oo, [0,oo), j) < oo, 7 c Z^(oo, [0, oo), 1). 
Then 

(3.169) B^^^. 

c 

Furthermore the average birth rate of new sites (by arrival of a migrant at an unoccupied site) 
at time t (in the process in the McKean-Vlasov dual) is equal to 

(3.170) a = cS-7. 



3.2.6 Dual process in the collision regime: macroscopic emergence 
Proposition 11.71 (a) 

We have obtained in Subsubsection [5.2.41 a lower bound on the emergence time by an upper bound 
on the number of dual particles. Now we need an upper bound on the emergence time via a lower 
bound on the number of dual particles. 

In order to prove (|1.67p return to the dual particle system specified below p.97p . Then after 
the lower bound for the emergence time in Subsubsection I3.2.4| where we ignored collisions in the 
dual particle system, we derive here a lower bound on the growth of the number of occupied sites 
in this system starting with one occupied site incorporating the effect of collisions. We proceed 
in five steps, first we consider lower bounds on the number of particles in the dual process, then 
refine this in a second step by constructing a multicolour particle system which is an enrichment of 
the dual particle system, then in a third step we prepare the estimation of the difference between 
collision-free and dual system, and in the fourth step we turn this into an upper bound for the 
emergence time. In Step 5 we then show the convergence of the hitting times for reaching [eA'^J 
dual particles. 

Step 1: Number of sites occupied by the dual process: preparation 
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Here we must take into account the effect of collisions and show that (|3.139p indeed gives the 
correct a which describes the growth of the number of factors in the dual process. 

If we have collisions, that is, migration to occupied sites, we have two effects. (1) We have 
interaction between sites. (2) If a particle migrates from a site containing only one particle to an 
occupied site, this results in a decrease by one in the number of occupied sites. For these two reasons 
the process counting the number of occupied sites, denoted by , is no longer approximated by 
a nondecreasing Crump-Mode- Jagers process (which is based on independence of internal states) 
when it reaches size 0{N) and at this moment it is also no longer non-decreasing since now singly 
occupied sites can dissappear by a jump to an occupied site. 

To handle these two problems the key idea is to carry out the analysis of the dynamics separately 
in three time intervals defined as follows. Set first 

(3.171) riog N = inf {i : Kt = [log N\}, (e) = M{t : Iff = leN\ }. 
Then define the three time intervals as: 

[0, TiogAr), 

(3.172) [riog N,riosN + - (log N - log log Nj), 

a 

hog N + - (log TV - log log TV) , (e)] . 
a 

What happens in the first interval? Note (since riog at << t^{s) for N — >■ oo) that asymp- 
totically as — > oo, in a single migration step the probability of a collision is 0{N~^ log A^). In 
particular over a time horizon of length o{N/ log N) the probability to observe a collision ever goes 
to zero as iV — >■ cx). Hence looking at the formulas we obtained for a and with the properties of 
exponentially growing populations (occupation measure in current size) we can replace our dual 
with the mean- field dual without collisions so that with p.l43p we can conclude as oo: 

/0 17QN log log TV \ogW 

(3.173) riog AT = ho(l) 

a a 

and we know furthermore that this process has (asymptotically) reached the stable age distribution 
at time riog n according to the analysis of Subsubsection 13.2.51 Step 2 therein. 

Now consider the second interval. From the previous upper bound result still no collisions 
occur comparable to the dual population in this interval (the collision probability is now at most 
0((log A^)"^) and hence at most only finite many collisions may occur over the whole time interval, 
essentially at the end and we can assume that asymptotically as A^ — > oo we can replace our dual 
process again by the collision free mean-field dual at the left end point but now in the whole time 
span the stable age distribution is in effect. Hence we can in the second interval continue working 
with the mean-field dual without collisions and we can even assume the stable age distribution in 
effect. 

Returning to our dual process we conclude (recall p. 1721) ) that at the end of the second interval 
we have as A^ — ^ oo: 

A^ 



TlogN + i(log W-loglogAf) logA^' 

'' log N ' 



Remark 20 Note that this means that — tJ^„m, with s^ >> log log A'' but sn << log A', is 



asymptotically as N ^ oo deterministic. 

We now consider the third interval, i.e. the regime in which by p.l74[) above 

(3.175) > W- 



logN 

In this interval two new effects must be considered, namely, 
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(A) a decrease in the number of occupied sites when a lone particle at an occupied site jumps to 
another currently occupied site 

(B) a particle moves from a site occupied by more than one particle and jumps to another occupied 
site. 

We note that effect (A) also increases the age distribution of occupied sites since in a site the 
process is stochastically increasing and the young sites can disappear therefore easier (and hence 
the age of the total population becomes stochastically larger). Effect (B) tends to increase the size 
of occupied sites of a given age compared to the case without collision. Therefore the result of 
(A) and (B) is to tend to increase the average size of occupied sites compared to a system without 
collision (recall the older sites are stochastically larger), but decreases the number of occupied 
sites. 

We note that larger sites increase the number of migration steps. Altogether this means that 
we expect that the rate of growth of denoted /3jv(0 (i.e. /3Ar(i) = {K^^^^ — K^)/K^) satisfies 
in the third time interval (for N > log log A^) if the present state of is k: 

(3.176) /3^(.) > «(1 - A). 

In order to control these effects precisely we use the technique of coupling constructed from an 
enriched, i.e. multicolour particle system, which we introduce next. 

Step 2: A multicolour particle system 

To examine rigorously the growth in the time span, in which the number of occupied sites 
reaches 0{N) we construct a coupled system of multicoloured particles such that we can compare 
the new system with collisions effectively with the simpler one without collisions, and are able to 
carry out estimates on the difference. This way we can obtain the lower bound on the number of 
occupied sites and the total number of particles. 

First an informal description. We shall have white, black and red particles. We start with white 
particles only. In intervals 1 and 2 the process of the white particles grows as before, i.e. without 
collisions. However as soon as collisions occur we consider a modified system. The colouring of the 
system allows us to keep track of events like (A) and (B) given above. For example in the case of 
(A) we will mark the lost site by placing there a black particle and the particle that jumped and 
collided with other particles on the new location is marked by giving it a red colour. We handle 
the second effect (B) by using bounds from below on the rate of founding of new occupied sites. 

Formally proceed as follows. The multicolour comparison system has black, white and red 
particles. It has white and red particles located at the sites {1,2, •••,A^} and black particles 
located at a site in N where {1, 2, • • • , N} and N are disjoint finite and countable sets respectively. 
In other words the geographic space of this new system is 

(3.177) {1,2,---,A^} + (N), 
and the state space is 

(3.178) ({1,2,---,A^} + (N))^«. 

The initial state is given by having only white particles, which are located at sites in {1, 2, • • • , N} 
such that occupation numbers are exchangeable on this part of the geographic space. 

The dynamics of the new system is markovian more precisely it is a pure Markov jump process 
and we specify the transitions and their rates. Instead of writing the generator we describe this 
more intuitively in words. This runs as follows: 
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white particles at a site follow the same loeal dynamies as the dual particle system i] as 
far as birth (of white particles) and coalescence (of white particles) goes, changes occur for 
migration. 

Let k denote the number of sites having currently at least one white particle. Each migrating 
white particle moves with probability 1 — to a new site in {1, 2, • • • , N} which prior to 
this event did not contain any white particles and with probability changes to a black 
particle now located at a new unoccupied site in N and at the same time also a red particle is 
produced at an occupied site in {1, 2, • • • , N} chosen at random among the k occupied sites. 

Hence a migrating white particle produces: 

k 

(3.179) a new site occupied with a white particle with probability (1 — — ), 
and two particles, 

k 

(3.180) one black and one red, with probability — , 



(3.181) 



the red particle is placed at a randomly chosen occupied site in {1, 2, • • • , N}, 
the black one at the smallest free site in N. 



Red particles have the same dynamics as the dual particle system on {1, 2, • • • , N} (newborn 
particles are also red) and in addition when a red and white at the same site coalesce the 
outcome is always white. 

Black particles follow the same dynamics as the white except that migrating black particles 
move on N and always go to a new, so far unoccupied site in N. 



This means that no collisions (by migration!) occur among white particles and furthermore by 
the collision convention the white particles are not influenced by the presence of red particles, nor 
are they influenced by the black particles. Hence the key observation about the new system is 
that: 

• The number of occupied sites in the union of the black and white particles follows the dynamics 
of the process without collisions, i.e. the number of sites they occupy is a version of (ii't)t>o, 



the number of occupied sites in the union of the white and red particles follows the exact 

)t>o- 



dual dynamics, i.e. the number of sites they occupy is producing a version of {K^)t 



The process of white and black particles per site follow the dynamics of C and the one of 
white and red particles per site that of C^- 



• We have a coupling of Q and C given by the embedding in the multicolour system and the 
difference process Ct ~ Ct^ can be represented as the number of black minus the number of 
red particles at the various sites. In particular also and K — can be represented 

in terms of the multicolour system. 

This construction allows to give upper and lower bounds on the time to reach with the dual 
process \sN\ occupied sites, which then in turn gives bounds on the number of factors in the dual 
process. 

Since we know from the last subsubsection how the growth of the population of both black and 
white particles works, we use this as an upper bound on the number of white particles (conditioned 
on W). 
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We denote the number of sites occupied only by black particles (recall " only" is by the construc- 
tion not a constraint), respectively the number of sites occupied by either white or black particles 
at time t by 

(3.182) , respectively 

Note that K is a version of K. 

We shall get below an upper bound on the number of sites occupied by black particles and then 
we can use this upper bound to get a lower bound on the number of sites occupied by white particles 
and therefore a lower bound for the number of factors of the dual (red plus white particles). 

Let 

(3.183) ?^(e) = inf{t : > e TV}. 
This means in particular that 

(3.184) {■) is nondecreasing. 

(Note the difference between {e) and t'^ (s) is that we use in the first case white and black 
particles and in the second case white and red particles. In addition K is a version of K.) 

We just saw above that we have to estimate the number of black particles to compare the dual 
and the dual of the McKean-Vlasov limit which differ by the difference of black and red particles. 
The latter is estimated above stochastically by the number of black particles. This holds since if 
we associate the black and red upon their creation, we see that the red ones have an extra chance 
to disappear once they coalesce with a white particles or collide with red particles from another 
black-red creation event. 

Step 3: Estimating the number of black particles. 

An upper bound on the number of black sites (recall black particles sit on N, while white and 
red sit on {1, • • • , TV}) at time r^(e), is obtained as follows. Bound the number of migration steps 
in the model by a Crump-Mode-Jaegers process which is in distribution given by the process of 
black and white particles. This means we construct up to terms of order o{N) a stochastic upper 
bound for the number of black particles by realizing independently a Poisson stream of potential 
collision events according to the law of a driving CMJ-proeess generating potential new sites and 
with a collision probability given by the current number of white sites divided by N. 

Therefore an upper bound on the production rates of new black populations is obtained by 
integrating the production rate of new sites generated by the process of white and black particles 
at a given time (which is for large times converging to a by p.l36p ). Furthermore note that in 
state k (meaning the number of sites with at least one white particle) with a migration step of 
a white particle, the probability of a collision and hence the probability to become black is k/N. 
This gives at time s with the CMJ-theory a production rate of collisions which is stochastically 
bounded by 

(3.185) a^^W.e"^ 

Hence we get a stochastic bound on the intensity of migration steps resulting in collisions of the 
form W^a{e°"^ /N)e"^ . This expression therefore bounds the rate of creation oi founders of new 
black families of particles. 

We can replace for s > Ti^gN, the quantity in the limit — > oo by W^, due to the 
convergence theorem for CMJ-processes. Therefore we can estimate the rate at which new black 
families are created asymptotically as TV — >■ oo and work with the expression 



(3.186) W^ae^^yN for s > n, 
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The contribution to the stream of production of cohisions arising before time riog n goes to zero in 
probabihty as — > oo and therefore the probabihty that black famihes are founded before time 
T\ogN tends to zero as ^ oo, more precisely at rate {log N)/N. 

In principle the time we consider, i.e. from tiq^n to TAr(e) seems random on first sight, however 
it is asymptotically deterministic. Namely tn{£) — TiogN becomes deterministic in the sense that 

1 eA^ 1 logiV 

(3.187) (fjv(e)-riogAr)-(-log( — )--log(-|— )) 0, a.s. 

so that conditioned on W as N ^ oo,tn{£) — TiogTV can be replaced by a deterministic quantity. 
This allows us to generate as upper bound on the number of black particles the following process. 

(1) Realise a Poisson point process with intensity measure 

(3.188) {W^ae'^°'yN)ds. 

(2) Then let from each point of this Poisson point process evolve independent families of black 
particles with the usual dynamics but independent of the Poisson point process. 

Next observe that a new black founding particle generated by the Poisson point process has 
a descendant black population which forms by construction a Crump-Mode- Jagers process with 
growth rate a, i.e. it grows like W^(?N(£)_^)e"'^'^ (e)-^)^ if g is the birth time of the black particle 
and provided we observe up to the final time T^(e). Here {lysjseR are independent copies of W. 

The newly founded black populations created at the Poisson point process jump times observed 
at time r/v(£) have sizes 

(3.189) {W^(?„(e)-.),jvexp(a(fA.(e)-,s)), e {sf , s^, • • • , ,sf,(^)}}, 

(we will suppress the subscript N for in formulas below) and are independently distributed 
conditioned on the process of birth-times and r/v (e) . For these copies we have a law of large numbers 

acting. The claim is more precisely that if we take the expectation E over {Wrf^(£)-si,Ni i G I^} 
then as TV ^ oo we have the following law of large numbers effect: 

M(N) "^"(^^ 

(3.190) ^ W^?„(e)-s.,wcxp(a(fjv(e)-5,,^))^ j aM^2(e2"7iV)^[W^(?„(,)_,),^]e"(^"(^)-^)ds 
and the r.h.s. is asymptotically as — > oo equal to 

(3.191) E[W] I aW^2(g2as/^)ga(?„(e)-s)^^^ 



using the convergence theorem for the Crump-Mode Jagers process. 

Then combining (|3.189p and p.l90p together with p.l9ip we obtain a mean conditioned on W 
of the l.h.s. of p.l90p which is asymptotically as A^ ^ oo equal to: 

(3.192) [ j £:[iy]aW^2e"^«(^) • e"«ds] - E[W]W^ ^{r^e"^ N"^) = E[W]e'^N. 

To justify the law of large numbers given in (|3.190p . we observe that the number of birth events 
of black populations goes to zero for times t n with 

log A^ 

(3.193) tN - - oo 

2a N^oo 
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and for these times the order of magnitude of a descending population is at most of order V^V 
o{N). Therefore we get a diverging number of contributions which are all o{N). 
More precisely, the law of large numbers is verified by showing that 

(3.194) Var m.„(.)-.),.e"(^"(^)-^)l = 0(^^) = 0(^°^^' 



Here we use the explicit representation and the fact that Var{Ws) < oo and Var{Ws) -> Var{W) 
as s oo together with the asymptotics of TN{e). This concludes the proof for the law of large 
numbers. 

Hence we can bound the number of black particles by the asymptotic growth in N of its 
expectation over the randomness in the process of growth, i.e. the W^n in the black population. 

This reasoning gives the following asymptotic stochastic upper bound on the number of black 
sites, namely the expression: 



(3.195) f ^ ^ a^e°"E[W,]e"'-^"'-''^-''Us - E[W]Ne^, as N ^ oo, 



using the upper bound on (e) implied by p.l43p . 

Therefore we conclude that the proportion of black sites among all sites in the black and white 
system at time (e) is at most (in the limit N — > oo) equal to 

In particular by letting e — ^ this relative frequency goes to 0. 

In particular this would imply that the calculation below of (e) (the time for the number of 
white plus red sites to reach e) will give a stochastic lower bound for 

(3.197) {e - const -e^), 

if we condition on W. 



Step 4: Upper bound on emergence time 

Recalling p.l76p we see that given the current state k of , the time of the next birth in 
is a positive random variable denoted for which we are waiting for the next migration step of 
a white or red particle to an unoccupied site. This waiting time is bounded above by the waiting 
time of a white particle migration step to an unoccupied site. 

Observe that due to the uniform migration distribution, the property to jump to an unoccupied 
site is an experiment independent of everything else and the jump is successful with probability 
1 — k/N if k is the current number of occupied sites. Therefore we can view the formation of new 
occupied colonies as a pruning of the point process of white particle migration steps. 

What can we say about the point process of the attempted migration steps by white particles? 
We note first that this migration step is given all current internal states the infimum of k, inde- 
pendent exponential waiting times, each of which is the minimum of exponential waiting times for 
the internal transitions like birth, death and emigration. Once an internal transition other than 
migration occurs, the experiment starts over with this new internal states. The rates of all this 
exponential waiting times depend on the internal state in the corresponding site. It is best to view 
this as trials consisting of migration steps arising from k waiting times built from exponentials 
according to the internal states, waiting for one which leads to a new colony. 

We know from the CMJ-theory that the number of migration steps in the white plus black 
system, with k occupied at time i, in the time interval [t, t + At] behaves asymptotically as akAt 
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as fc — > OO a.s. and in Li as long as k has not reached too large values. We now use the 
observation that the relative frequency of black colonies is asymptotically as iV — > oo negligible so 
that asymptotically the white migration steps are as frequent as the ones of the white and black 
system as long as k is large but k/N remains small. 

When the current number of occupied sites is k the waiting time for the next colonization has 
mean bounded above by that of the waiting time for the birth of white site. To obtain information 
on this random time we note that the next migration of a particle is given by an exponential random 
clock which occurs with a hazard function given by the rate c^jLj^ and the probability that 
a migration produces a new white site is 1 — (compare (|3.176p ) . To get an upperbound on the 
time Tiogjv we note that > 1 and note that the number of migration events needed to obtain 
a new white site is geometric with mean (1 — k/N)^^. 

Recall that Tiog n is given by (|3.173p . Then recalling the convergence to the stable age and size 
distribution for t ^ oo for fc — ^ oo and therefore ^ X]i=i Ci(') ~ cti argue below in detail that 
the expected time, given the internal states of the occupied sites to obtain the next white site for 
log N < k < e'N, behaves for — > oo as follows: 

1 1 _ 1 1 1 1 1 

Eti lc.>iC.(-))(l - #) ^ ka{l + + aN{l - e') " 

To make more precise the first step, note first that 

(3.199) lim (-^v E lc.>iC.(-)) = lim 

We know from the CMJ-theory that: 

OO OO 

(3.200) I lim Vfc[/^(TiogAr+,-,fc) - Vfc[/(cx),A:)| = 0. 



N- 

k=2 k=2 



Therefore 



(3.201) I ^lim ''^kU^ {Tiogis^^j,k) — a\ — uniformly in j. 



N- 

k=2 



N- 



Secondly we now have to deal with the fact that the quantity we handled above appears in the 
denominator and is random, so that taking expectation needs some care. Note that X]fc°=2 kU^ {riog 
j),k) > l-C/^(riogjv+,,l) and 

(3.202) lim (1 - [/'^(riogAT+j, 1)) > ^(1 - C/(c», 1)) > 0. 



We treat below the event (1 — (riog at+j , 1)) < a for small a as a rare event and obtain a large 
deviation bound for it allowing us to proceed for N ^ oo with assuming we are in the complement. 

We break the details of the argument down in two parts, namely, the probability that more 
than of the individuals have age less than or equal to B, that is, ^ logiV particles produce 

i logiV new particles in time < B. We choose B so that the expected number of particles produced 
by one site in time B is < ^. The second part is to calculate that more than a fraction (1 — a) of 
the sites of age > B have only a single particle where a > is chosen so that if a site is of age > B 
then P{( > 2) > 2a. Since the events that particles produce particles in time B and 

the event that more than a fraction (1 — a) sites of age B have C = 1 are both "rare" events, we 
can verify the exponential bound: 

(3.203) P[(l - U^inogN+j, 1)) <a]< e-c(i°s^+J) for large TV, for some C > 0. 
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Wc now take expectations conditioned to be on the good event, 

(3.204) E4-] - E[-\{1 - C/^(riogjv+j, 1)) > a for j = 1, . . . , eA^ - logiV]. 
We can then conclude (by bounded convergence) that: 

(3.205) lim ^4 ] — ] < -, uniformly in j. 



a 



This means that we now can conclude with Tk denoting the waiting time for the next migration 
step if we have k occupied sites satisfies: 

(3.206) E4n]<^,+^;j^^y ke[lo,N,s'N]. 

Remark 21 For the case considered in Section 8 it is convenient to note the following simple 
generalization. Consider a system of k independent exponential clocks Ci , . . . , Cfe with exponential 
waiting times with parameters Ci , . . . , Cfc . Then 

(3.207) P(minC, < ^) = (1 \lP{C^ > ^)) 

i 

= {l Yl e-'"'"''^) = 1 - e~** ^^=1 ask^oo. 

i 

In our case Ci = Ci G N and this leads to 1—e^^^k ^'^ (nagN+j,k)) ^ therefore mean 
and second moment 2 ( r^p^ — r-mvi rr I and so th 



1 



T-r^ — , riNi PT I o,na so trie variance is i t-v^ — ttttvt rr 

In order to conclude the argument we next choose e > 0. Then for sufficiently large N, the 
mean time, E^[T^{e)], for K^^ required to reach eN {ci. p.206p ) satisfies 

(3.208) E4r^{e)]< E[r,^,M + + ■ ■ ■ + ^^] 

1 JogiV 1 , e 



2 



a' N '" eN' Q'(l - e) ' 
Hence 



(3.209) E4T^ie)\W]<^^^ + cie), 

a 

where c{e) is a constant depending on e and W. 

We also have from the CMJ-theory the lower bound 

(3.210) E[r^{e)\W]>E[r^{e)\W]=a-HogN- (^U^g£]±i^^ . 

Therefore it follows that (e), conditioned on W, is of the form a^^ log + 0(1), as claimed. 
We also see that the difference between {e) and T^(e) is a random variable satisfying 

(3.211) £;,[|T^(e)-?^(£)|] <oo. 

We can also estimate now the time till fixation as follows. The coupling construction now 
implies that if we choose e small enough, then at time logN we will reach eN dual particles 
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which win then in finite random time produce a mutation jump so that we get indeed (jl.67l) and 
thus completing the argument. Namely since 

(3.212) nf > kt - , < r.h.s. (EUSD, 

after time T^(e) there is a mutation rate at least 

(3.213) m{e ~ E[W]e^)+ 

uniformly in large N and therefore the probability (recall T^(e) — 'r^(e) = 0{1) as — > oo) that 
a mutation does not occur before T^(e) + t goes to zero as t — >■ oo uniformly in TV if we choose e 
small enough. This completes the proof of (|1.67|) . 

Step 5: Extension: convergence of hitting times 

However we can get even more mileage from the above argument to make it clear that the 
randomness in the evolution of spatially macroscopic variables is created only in the very early 
stages of the growth of the dual particle system. 

We next estimate the variance of T^(e) — '''i^gjvi * random variable of which we know from the 
analysis of Step 4 above that it is asymptotically close to t^{£) — t^^n as TV — >■ oo and e — )■ 0. If 
we now condition on all the internal states of all occupied sites we can calculate the variance of the 
waiting time for the next migration step explicitly since we know it is exponential. Hence we 
can, denoting by Var{-) this conditional variance, estimate yar(r^(e) — Tj^g^y)- Conditioned on 
W and restricted to the complement of the rare event introduced in Step 4 (which is asymptotically 
negligible (|3.203|) ) we can verify by explicit calculation (following the same argument as for the 
means in (j3.208p now for the variances, in this case we get the sum of squares of the terms we had 
in the previous mean calculation) that with Var,, denoting this conditional variance restricted to 
the complement of the rare event, we get the bound 

(3.214) limsupyar*[(r^(£) - Tiogjv)] < limsup (const • ( ) ) = 0- 

N^oo Af->oo V V log IN eJ-N// 

Now we note that the term in the limsup on the r.h.s. is independent of the internal states and 
we conclude that conditional on W the (restricted) variance of the difference between T^(e) and 
TiogN goes to zero and hence that is, the variance of the sum conditioned on W of the waiting 
times (with means given in p.l98|) ) in the second and third time intervals given in p.l72|) goes to 
as ^ oo since the rare event (and hence the restriction) become negligible as iV — >■ oo. The 
analogous result also holds for (e) — riog n ■ 

This means that the growth in the second and third time interval we have specified in p.l72p 
is deterministic and the randomness occurred earlier in the first time interval. 

Note that during the first time interval as TV ^> oo the expected number of collisions is 
0(-fi2SiY}_) so that no collisions occur with probability tending to 1. Therefore as TV — > oo, 

(3.215) ?w AT - T-,^,. AT — ^ in probability. 

Together with the fact that as TV — >■ oo the limiting probability of the rare event is it follows that 
there exists a deterministic sequences, namely 

(3.216) {6% - - Ti^gwD^eN, = E^r^^ ~ t^^])^.^^ 
such that for r\ > 0, 

(3.217) P[|f^(e)-?,^g^-J^|>r;]^0, 
and 

(3.218) P[|T^(£)-<gA.-<5^|>r;]^0. 

We now collect the above estimates to prove the following. 
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Proposition 3.5 (Convergence of hitting times of level s) 

There exists a non- degenerate S.-valued random variable T(e) such that 

(3.219) f^(e)-i^^f(e), as N ^ oo, 

a 

(3.220) (s) — E,,[t^ (e)] converges in distribution 

and there exists a constants {C(£)}£>o such that limj_>.o C(e) = and 

(3.221) lim pf ^-SN + loge-logW ^ ^ 
N^oc \ a 



W] =1. □ 



logiV + loge-logW^ , . 

< h 20(£j 

a 

Proof We begin with some preparatory points. Note that asyniptoticahy as iV — >• oo by (|3.143p . 

\og\ogN -\0gWN 
3.222) Ti = o 1) 

^ a 

where Wn W a.s., and by p.215p it follows that 

(3.223) = 0(1). 

^ a 

But by (l3:2T7l) . (ISTHS)) . 

(3.224) PW{e) ~no,N- S%) (r^(e) - <^ - <5^)| > 2ry] ^ 0. 
But then (|3.21ip implies that 

(3.225) sup|?^ - (5^1 < oo. 

N 

We also have that % - ( log(jVs)-iogiogjv -^ Therefore as iV ^ oo: 

(3.226) ^A^(,)_l^^log^-l"g^. 

Having finished the preparation we already see that p.226p implies the weak convergence 
claimed in p.219p . Now we come to the proof of (|3.220p and p.22ip . 

The relation (|3.22ip then follows from using p.234p and inserting p.223p . p.222p together with 
p.225p and the fact that sup — 5j^\ tends to zero as e — ^ 0. In order to see now p.220p we 

N 

have to use that given W the (e) — Tj^ ^ is as iV ^ oo asymptotically deterministic as we saw 
above. Then the claim follows with (|3.223p . 



3.2.7 Dual process in the collision regime: nonlinear dynamics 

In order to obtain more detailed results on the emergence regime and on the process of fixation via 
the dual process we will need more precise information about the growth of the number of sites 
occupied by the dual process, respectively the states at these sites, once we reach the regime where 
collisions play a role. 

The behaviour in this regime is qualitatively very different from the collision-free situation and 
is the result of three properties of the evolution mechanism: 
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1. Randomness enters in early stages of the growth of the dual process, i.e. times in [0,0(1)], 
here the dual behaves as a CMJ branching process described by linear (i.e. Markov) but 
random dynamics, 

2. in the time regime 

log N 

(3.227) [s{N), -^), with s{N) ^ C30, s{N) = o(logiV) 

the law of large numbers is in effect leading to deterministic but still linear dynamics 

3. in the regime {s{N) as above) 

log log N 

the dynamics is deterministic but the evolution equation becomes nonlinear due to collisions. 

The problem is now that these regimes separate as N ^ oo and we have to connect them through a 
careful analysis. To integrate all three phases into a deterministic nonlinear dynamic with random 
initial condition, the full dual process for the finite system with N exchangeable sites is analysed. 

Indeed in this and the next subsubsections we show that in the limit N ^ oo the initial 
asymptotically collision-free evolution stage in (1) and (2) produces a random initial condition for 
the nonlinear dynamics arising in (3) where the randomness arises from stage (1). A key step in 
linking the random and nonlinear aspects is the analysis of the evolution in the time interval in 
p.228|) in the limit N oo, t^ -oo. 

In this subsection we consider the time interval 

(3.229) - [log TV -log log AT, log iV + T) 
a 

in which the dynamics are deterministic. Recall that 

(3.230) u^(-(log AT - loglog AT)) ^ Wnt^ = W-^ + o(l) as A^ oo, 

a logN log A* 

since Wn =^ W hy the equation (|3.143p . 

We have to describe the dual process in the above time interval in the limit N ^ oo and to 
obtain a limit dynamic which allows to draw the needed conclusions for the original process. We 
proceed in three steps. It turns out that the main properties of the dual process needed for that 
purpose can be captured in a triple of functionals of this process which we introduce in Step 1. In 
Step 2 we formulate the corresponding limiting (for A^ — >■ oo) objects which allows us to finally 
state and prove in Subsubsection [372.81 the key convergence relation for the dual process. Step 3 
proves that the defined evolutions have the desired properties. 

Step 1 Some functionals of the dual in the time span up to fixation 

First, we observe that to determine the quantity of interest for emergence namely 

Tl^ I, \oeN 
3.231 , for Tn = 

N a 

as a function of t we need to know the number of sites occupied at time Tn + t and relative 
frequencies of the occupation numbers of these sites. Since the occupation number of a site depends 
on the age of the site, we will keep track of the number of occupied sites and the relative frequencies 
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of both age and occupation number for these sites. As iV — > cx) a law of large number argument is 
then used to obtain the desired information. 

We consider the number of sites occupied at time t denoted and the corresponding measure- 
valued process on [0, oo) x Nq giving the unnormalized number of sites of a certain age u in [a, b) 
and occupation size j: 

(3.232) *^(t, [a,6),j)) - f^' 1(k->k- )l(c^ it)=j)dK^ , 

J{t-b) 

where {t) denotes the occupation number at time t of the site born at time u, that is, a site first 
occupied the last time at time u, which is therefore at time t exactly of age t — u. 

The normalized empirical age and size distribution among the occupied sites is defined as: 

(3.233) C/^(t,[a,&),j) = * > 0, j S {1, 2, 3, . . .}. 

Thus for {Ct^)t>o we have a growing number of different interacting birth and death pro- 
cesses which each have in state k linear birth rate sfc, linear death rate c(l — jf )k, quadratic death 
rate dk{k — 1). In addition immigration of one additional element into a site of size k occurs with 
rate 

(3 234) c |^ A'fc'H«jvW+7jv(^))-fc 

where aN{t) is c times the mean size of sites with more than one particle, is c times the 
frequency of singletons; these will be defined precisely below in p.239p . Note that U'^{t,dx,j) is 
a pure atomic measure which evolves with jumps of size jhr one for each migration event to a yet 

unoccupied site, or when a singly occupied site migrates and a collision occurs. 

Remark 22 Note that the Cu i') interact if we are in a regime where collisions occur, that is at 
times after emergence has occurred which happens at times of order . The marginal distribution 
of age is the analogue of the empirical age distribution in \3.16S\] . However now we take into 
account possible collisions, that is, a migration to a previously occupied site when we consider the 
system of N sites with symmetric migration among these sites and in this case the distribution of 
Cu i^) niay depend not only on the age t ^ u but also on t due to changes in the collision rates. 

Set for convenience to describe the pair given by the number of sites and size-age distribution: 

(3.235) u^it) := A'f . 

We have obtained with (|3.233p . (|3.235p now a pair which is N x P([0, oo) x N)-valued, denoted 

(3.236) (^i^(^),C/^(^,•,•))t>o 



and which completely describes our dual particle system up to permutations of sites. We can 
If 



recover the functional Ili^ as 



(3.237) nf = ^i^(^)5]Jt/^(^,[o,^],J). 

However since we work with exchangeable initial states for our original population process, the pair 
{u^ , U^) provides a sufliciently complete description of the dual process to allow us to calculate 
the laws of our original processes. 
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In the interval a ^[logA^ — log log iV, log + T] the process (M^(t))t>o increases by one, re- 
spectively decreases by one, at rates 

(3.238) aNit){l respectively 7jv(i)- 



where (t) , 7jv (t) are defined: 

t t 



These rates of change of {t, •, •) follow directly from the dynamics of the dual particle system t]. 

We note that if we start with k particles at each of £ different sites we obtain similar quantities, 
which we denote by 

i-o o/in\ f N,k,£ jjN,k,t\ 

(3.240) (ut ' )t>o. 

The dynamics of the the system 

(3.241) {u^{t),U^{t,ds,dx)) 

evolves in two main regimes corresponding to times o(log A^) and those of the form \ogN + t, 
with t = 0{1). They are asymptotically as A'^ — > oo linear and random for times of order o(log A^) 
as we have demonstrated in the previous subsubsection with the randomness captured by the 
random variable W given in p.l43p respectively p.230p above. On the other hand the dynamics 
become deterministic and nonlinear in the post-emergence regime, that is, for times of the form 
(Tjv 4-i)tg(_oo,cx)) where T/v = . This latter case involves the law of large numbers in the limit 
N oo and collisions play a decisive role. We will introduce the limiting dynamics in Step 2. We 
will later in Subsubsection 13.2.91 show how the two regimes are linked and that the first produces 
a random initial condition for the second, the nonlinear regime. 



Step 2 The limiting dynamics for dual in collision regime 
The second step is to define for k,£ £ N candidates 

(3.242) (nf u'^'^it), U'^'^it, ds, dx))tm and (C(O)teR 

for the limiting objects as A^ — > oo related to (here Tn = ct^^ log A^) 

■2f±i, ^f±i, U^^^^'^{Tn + t, ds, dx)y -oo < t < oo, 

{C^'('='^)(rAr 4-t,i), i^l,---.N}, -oo<i<oo, 

when the dual process initially has k particles at each of £ distinct sites at time t = 0. 
We now turn to the limiting objects as A^ — > oo which we denote 

(3.244) (^^^^i(^),^7(^))teK, (C'^'(i))t6K- 

We remark that it will turn out that conditioned on (nf^^, u'^'^{to), J7'^'^(io, ds, dx)), the dynam- 
ics of (nf u^'^{t), U^'^{t, ds, dx))t>to is independent of k, I. For this reason we often suppress the 
superscripts and later indicate how the initial conditions for k,l> \ are determined. 

Since 

oo 

(3.245) n,^'^ = u^^t) . . U^-\t,^+,3), 



(3.243) 
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and since the process C will depend on this pair from the triple in p.242p it suffices to specify 
the pair {u{t), t/(t))teR. This is done by first establishing that they satisfy a pair of deterministic 
equations and then determine the entrance law at —oo as follows. 
We first define two ingredients (parallel to (|3.239p ): 

oo C30 

(3.246) a{t) = a{U{t)) c J jU{t, ds, j), j{t) = 7(C/(i)) c J U{t, ds, 1). 



As before we first consider the evolution equation for times t > to and then we try to characterize 
an entrance law by considering — ^ — oo. For the moment, for notational convenience let to = 0. 
Then to specify 

(3.247) (u, U) = {u{t), U{t))t>o, with {u{t), U{t)) e M+ x Mi{R+ x N), 
we introduce the (coupled) system of nonlinear evolution equations: 

(3.248) ^ = a{t){l - u{t))u{t) - j{t)u^{t), 



, dU(t,dv,j) 

(3.249) -^f^ 

dU{t,dv,j) 
dv 

+s{j - l)lj^iU{t,dv,j - 1) - sjU{t,dv,j) 

+ + l)jU{t,dv,j + 1) - - l))l,^iU{t,dvJ) 
+c{j + l)U{t,dv,j + 1) - cjU{t,dv,j)lj^i 
-cu(t)U{t,dv,l)lj^i 

+u{t){a{t) + j{t))[l,^iUit,dv,j - 1) - U{t,dv,j]) 
+ ((1 - u{t))ait)) lj=i ■ Soidv) 

-(a(t)(l - u{t)) - 7(t)"(t)) • U{t, dv,j). 

Note that if U{0, ■, ■) is a probability measure on x N, then this property is preserved under 
the evolution. (Recall here that a and 7 contain the constant c of migration.) 

We note that the right side equation (|3.249|) is only defined in terms of A^fin(IR^ x N) elements, 
if we restrict to measures for which X^j J^ ' U{dv,j) < cxj, so we have to restrict to a subset of 
A^i(R+ X N). Define v as the measure on N given by 

(3.250) iy{j) = l+f. 

and let L^ii', N) denote the corresponding space of integrable functions. 
We can write these equations in compact form as 

(3.251) ^(^.(t), Uit, ., •)) = G*{u{t), U{t, ., •)), 
with the nonlinear operator 

(3.252) G* mapping R+ ® L\{R+ x N,iy) ^ M.+ (g) ¥, 

with F denoting the class of finite measurable functions / on x N. 
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If u{0) = 1*0 > 0, wc also work with the equivalent Alfin(Ili+ x N)-valucd process 

(3.253) $t := u(t) ■ U(t) 

which solves the nonlinear systems 

(3.254) -^^G*{^t) 

equivalent to (|3.25ip (note that it is easy to verify that if u{to) = 'i'to(M.+ x N) > 0, then u{t) > 
for all t > to). 

We think of this as a forward equation for a deterministic measure-valued process. In order 
to formulate the corresponding Markov semigroup generator we consider functions Fgj on (M-|- (g) 
L\(R+ X N, i/)) of the form 



(3.255) 




gi:R— )-M, / :[0,oo)xN— g has bounded first and second derivatives, 
/ bounded and has bounded first derivatives in the first variable. 

The algebra of functions, which contains all functions of the form p.255p is denoted 

(3.256) Do{G) 

and is a separating set in Ct, (A^ fin ( [0, oo) x No)) and is therefore distribution- determining for laws 
on 7Wfi„([0,oo) X No). 

Remark 23 In fact since we are concerned with 5* which are sub-probability measures we note 
that the integrals are bounded, we can include unbounded functions g defined on [0, 1] and in 
C^([0, 1],M). In particular, we can consider g{x) = x and g(x) = .t^. 

Then we introduce the semigroup 

(3.257) VtF,j{%o)--^Fgj{^{t)), 

where $(t) = u{t)U{t) and {u{t),U{t)) is the solution of the pair p.248p . p.249p with initial 
condition — uq ■ Uq- Let 
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Then with Fgj- as in (|3.255p we set 

GF,j{^)^g' ($(/)) ($(G*/(r,j))J 
= .9' I^ndr^dx) 

^ ^01 

oo oo 

11 sx [{fir, x + l)- f{r, x)))] *(dr, dx) 

1 

oo oo 

+.9'(*(/)) / / - 1) [/(r, x- - 1) - /(r, x))] $(dr, dx) 

1 

• c(l - $(!)) /,°° [/(0, 1) + /(r, X - 1) - /(r, a;)]xl,^i$(dr, da;)) 

^ ^ OO OO ^ 

+,g'(*(/))*(l)c / / [f{r, x + l)- f{r, x))] x^idr, dx) 

1 

/ CJO CJO 

+.9'($(/))c(/o°° /r '^^)) / / 2? - 1) - /(^ 5?))] dx) 



(3.259) 



1 



g'(*(/))-//G*/(r,x)$(dr,dx), 



with the obvious extension to Dq(G). 

We can then consider the martingale problem associated with (G, Do{G)), that is, to determine 
probability measures, P, on C([0, oo), 7Wfi„(R+ x N) n B) (where B is defined below in p.262p ) 
such that 



(3.260) Fgji^it)) - f\GFgj){^is))ds 
Jo 



is a P- martingale for all F G Dq{G). 

Since G is a first order operator any solution to this martingale problem is deterministic. To 
verify this consider g{x) = x'^ and g{x) = x and show that G(Fj.2) — 2FxGFx = 0. This holds 
by inspection of p.259p . This implies that the variance of the marginal distribution ^'t(/) for a 
solution to the martingale problem is 0. 

Then taking next g{x) — x wc have that any solution satisfies 

(3.261) $t(/) - /* $3(G*(*'/)rfs = *t(/) - /* G*^s{f)ds. 



Taking / = 1 and then indicator functions / = Ij wc obtain our nonlinear system. In other 
words any solution to the martingale problem is a deterministic trajectory satisfying the nonlinear 
dynamics p.248p . p.249p . This form will play a role later on when wc consider the convergence. 

In order to discuss the existence and uniqueness of solutions to these equations wc introduce a 
norm and reformulate the equations as a nonlinear evolution equation in a Banach space, namely, 

(3.262) i R(g) L^(M+ X N,ds X z^) 
which we furnish with the norm 

(3.263) = ||(u, (a,(-)).GN)|l = l^l + |l(a,;(-))»eN|li, 
where u = $(M x N), ai{-) = $(• x {i}) and 

oo 

(3.264) |l(a,(.)).eN|ii := Y.'^l + f)\a,{-)U, 
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where var denotes the total variation norm for measures on M^.. On this space the r.h.s. of the 
equation (|3.249p is well-defined. 

We note that the Markov property is satisfied by the equivalent systems {u{t),U{t,M., ■)) and 
^'(t) = u{t) ■ U{t,R, •). Since this all that is needed for the dual representation we often suppress 
the ages. In this case we use 

(3.265) B :=R®L^(N,i/) 
and the simpler norm 

oo 

(3.266) ||(u,(a,))»GN)|l = |u| + ||(az)zeN||i, where ||(aO^GN||l := ^(1 + /)|aj|. 

Existence of a solution to this system will follow from the convergence result below in Subsub- 
section [3.2.81 fProposition [3?8| . The question of uniqueness given the initial point (1^(^0)7 U{to)) is 
considered in Step 3 below. 

Return now to the finite A'^-system and imagine the collection of occupied sites in our dual 
particle process at a specific time t represented in the form {t, = 1, . . . , Nt- To complete 
the description we must specify what happens at a single tagged site for times r > t, i.e. to 
C^(r, i), i e N in the hmit N ^ 00. 

Given the function {u{t), U{t)) this will be given by the time-inhomogeneous birth, death and 
immigration process 

(3.267) {Cit,i),t> 0}, with transition probabilities {pfcj(t), (fc,j) € N^ t> 0}, 

given by the forward nonlinear Kolmogorov equation starting with one particle at site i at time 0, 
i.e., pi.i(O) = 1 and which reads as follows: 

(3.268) p;^^.(t) = (s(j-l)+a(t)u(t))pij_i(t) + ^(j + l)jpi,,+i(t)-((,s + c)i + ^i(j-l))pi,,(i) 
for J ^0,1, 

(3.269) p'i,o(t) = cu(t)pi,i{t) ~ a{t)u{t)pifl(t), 

(3.270) pl,{t) = d ■ pi^2{t) - ((s + a{t)u{t)))pi,i{t), 
where 

(3.271) u{t) = 1 -pi,o(t), a{t)u{t) = jpi,,(t). 

This process C(t, i) will describe the evolution of a tagged site i in the limiting system in the 
collision regime. 

Next, in order to prepare for the study of the nonlinear system p.248p . (|3.249p we consider a 
related set of equations for the non-collision regime obtained by setting w = 0, namely, 

(3.272) 

dU{t,dv,j) _ dU{t,dv,j) 
dt dv 

+s{j - l)lj^iU{t,dv,j - 1) - sjU{t,j) 

+ + l)jU{t,dv,j + 1) - - l))lj^iUit,dv,j) 

+cij + l)Uit,dv,j + 1) - cjUit,dv,j)lj^i 
+a{t)[So X Ij-^i - U{t,dv,j)], 
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where a{t) :— c^^2 Io° ^ ' ^{t, dv, £). 

Since the dual representation expressions only involve the marginals ?7(i, M-i-, j) and the jump 
rates depend on j but not the age of an occupied site it suffices to work with these marginal 
distributions we denote by U{t, j). We then work with the corresponding versions of (|3.248p . p.249p 
and (|3.272p and use the norm (|3.266p . 

We observe next that the solution to the nonlinear system p.272p can be obtained as 

(3.273) Uit,dv,j)= 



where {V(t^ •, •)} solve the /mear system which we spell out explicitly later on, namely it is obtained 
by deleting the last term in p.289p . The resulting linear equations coincide with the mean equations 
for the CMJ process and hence the convergence of V{t, •, •) as t — ?> oo to the stable size and age 
distribution follows from the CMJ theory. In addition we have that a(i) — ?> a as i — ?> oo, the 
Malthusian parameter of the CMJ process. 

We define a linear operator given by the infinite matrix 

(3.274) Q'^ * on L\{N, v) C Alfi„(N), 

obtained by first integrating over ages setting ct(t) = a on the right side of the (|3.272p . Note that 
the equation resulting this way from p.272p can be viewed as the forward equations for a Markov 
chain Z on N with transition matrix Q"'*. Let 

(3.275) (S°')t>s) = the dual semigroup on A^i(N) of this time-homogeneous Markov chain on N. 

To verify that the semigroup S*" corresponding to Q"'* is strongly continuous on (B, || • ||), note 
that for ^ G A^i(N), 

(3.276) lim WSty. - m|| = \y^\E\Z\t)\ - E\Z\m = 0, 

follows from elementary properties of the Markov chain (Z(t))t>o. 

Note that the generator of 5" is an unbounded operator but has domain containing 

oo 

(3.277) Do := {(a»)»eN e B : ^(1 +/)|a,| < oo} 

i=i 

and in fact this is a core for the semigroup ( |EK2| . Ch. 1, Prop. 3.3), since the birth and death 
process with linear birth and quadratic death rates has moments of all orders at positive times. 
(Note that the corresponding Markov process on N has an entrance law starting at +oo. This 
property is wellknown for the Kingman coalescent but extends to our birth and death process 
provided d > and is easily extended to include an additional immigration term.) 

The Markov chain associated with Q"'* has a unique equilibrium state C/°(oo). Let m(a) denote 
the mean equilibrium value 

oo 

(3.278) m(a) = ^j-[/''(c»,j). 

The equation m(a) = a has a unique positive solution and it is equal to a. Moreover S*" satisfies 

(3.279) lim (5f[/(0)) =Zi(oo), limsup |a(i) - a| = 0, 

where Uipo) = [/"(oo) is the unique equilibrium with mean a and is the stable size distribution 
for the CMJ process. 
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Step 3 Uniqueness and existence results 

We next show that the equations we gave in Step 2 uniquely determine the objects needed for 
our convergence resuhs. 

Lemma 3.6 (Uniqueness of the pair {u,U) and of C,) 

(a) The pair of equations \3.24-8^ - \3.249^ , for {u{t),U{t^M.^^ ■)) given an initial state from 
M+ X A^i(N) satisfying 

(3.280) (u(io), C/(to, •)) e [0, 1] ® Mi{n) n L\fi, v) 

at time to, has a unique solution {u{t), U{t))t>t„ with values in [0, 1] ® (A^i(N) H L?j_(N, v)). 

(b) Given {u,U) the process (C(0)t>to ^■^ uniquely determined by i3.268\) . 

(c) There exists a solution (w, U) with time parameter t G M for every A G (0,oo) with values 
m B = M ® L^{n, v), such that 

(3.281) u{t)e'-°'*- ^ A as t^ -oo, 

(3.282) U{t) — > U{oo), and hmsupe""*|aW - a| = S < oo. 

Here U{oo) is the stable age and size distribution of the CMJ-process corresponding to the particle 
process {Kt,C,t)t>Q given by the McKean-Vlasov dual process rj, which was defined in 113.166]) . 

(d) Given any solution {u,U) of equations \3.248^ - {3.2^9^ integrating out the age (given in 
{3.28'T^ - ^KM^) ) forteR with values in the space B := R (g) L^{N, v) satisfying 

(3.283) u{t) > 0, Umsupe'"*M(i) < oo, t^ \\U{t)\\ is bounded, 

> — oo 

has the following property of u 

(3.284) A^ Urn e^"*M(t) 

exists and the solution of {3.2^8^ - {3.249^ satisfying \3. 284\ l for given A is unique. Furthermore 
U{t) and a(t) satisfy 

(3.285) U{t) =^ U{oo) as t ^ -oo, and Umsupe""*|a(t) - a| < oo. 

i— > — oo 

(e) The solution of the .system including the age distribution, that is, 1^3.248^ and \3.249^ has 
a unique solution. □ 

The fohowing is immediate from (|3.280p and the discussion in p.272p - p.26ip . 
Corollary 3.7 The (G, Dq{G)) martingale problem is well-posed. 

Remark 24 Note that looking at the form of the equation we see that a solution indexed by M 
remains a solution if we make a time .shift. This corresponds to the different possible values for 
the growth constant A in \3.281\) . In particular the entrance law from at time — oo is unique up 
to the time shift. 

Proof of Lemma 13.61 

(a) First define 

(3.286) 6(0 = 1 + ^ 
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and wc express a{t) and j{t) in terms of U{t) and and consider L-^{i',N) as a basic space for the 
analysis of the component U(t, ■). Then we obtain a system of coupled differential equations for 
the pair (w, U): 

(3.287) ^^a{t){l~b{t)u{t))u{t), 



^^^^'^^ -- - in^UitJ - 1) - sjUitJ) 



dt 



+c(j + l)(7(t, j + 1) - cjU{t,j)lj^^ 
(3.288) + [a(t) - a{t)u{t) - j{t)u{t))] 

+u{t){a{t) + jmU{t,j - 1)1,^1 - U{t,j)] 

-(a(<)(l-^i(i))-7(0^(0) -UitJ). 

We now consider the related nonlinear system obtained by setting u = in p.288p , namely, 

dU{t,j) 



dt 



+s{j''L)h^iU{t,j-l)-sjU{t,j) 

+ i)m,j + 1) - ^ju - i))i,^,u{t,j) 



+c{j + WitJ + 1) - cjU{t,j)l,^i 
+a{t)lj=i-ait)-U{t,j). 

where a(t) = c ^-UitJ). 

We will consider the system (|3.288p as a nonlinear perturbation of the linear strongly continuous 
semigroup 5" on the Banach space B ~ L^{i',N) given in (|3.275p . 

Then we can rewrite p.287p and p.288p in the form (G* stands for adjoint of G) 

(3.289) ^(«(t), Uit, •)) = G*iuit), U{t, ■)) = (0, U{t, ■)Q^-*) + F{u{t), U{t, •)), 

where the infinite matrix Q"'* is the infinitesimal generator (Q-matrix) of the Markov semigroup of 
5'" and denote by U°''*{t) the state if S*" acts on measures and F : B — > B is a (locally) Lipschitz 
continuous function on B, namely, 

(3.290) F{u, U) = {a{U)u - {-jiU) + a{U))u^ , ((a - a{U)) + u{a{U) + 7([/))))Z(J7)) , 

where Z{U) = (^j)jeN, with Z.j = U{j - ^l,Zi = -1. 

Note that F involves a third degree polynomial in u and a{t). Moreover U a{U) is a 
linear function and differentiable on B. Therefore we can conclude that F is with first and 
second derivatives uniformly bounded on bounded subsets of B. Therefore we can use the result of 
Marsden jMA| . Theorem 4.17 ( see Appendix|31) to conclude that there is a unique flow {u{t), U{t)) 
satisfying the equations and in addition the mapping t — > (u(i), U{t)) and the mapping {ut„, Utg) — >■ 
{u{t), U{t)) are Lipschitz in B. 
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Remark 25 (Alternate proof of (a)) 

We write the equation ^3.244^ as an a system of ODE in the form 

(3.291) B'{t) = G*iB{t)) = L{B{t)) + N{Bit)), 

where L is a linear operator ( generator of a Markov process ) and N is a nonlinear operator on the 
Banach space B. The latter is locally Lipschitz hut not globally Lipschitz. Furthermore the operator 
L maps B into finite well-defined functions on but not into B. Only if we restrict to the smaller 
sets of configurations B, i.e. configurations b satisfying b E L^{N,i') and '^j^\{b{j)\ < oo we 
obtain Lb € B. This subset of configurations forms a dense subset in L^(N,iy). This means we 
cannot view this as a standard evolution equation in the Banach space B and have to bring in some 
extra information to make the usual arguments work. Namely we have to show that if we start 
in the set B we remain there; we get an evolution in the Banach space B. The extra information 
needed comes from the probabilistic interpretation. 
Indeed the evolution equation 

(3.292) B'{t) ^ Q*{B{t)) 

has a nice dual Markov semigroup (backward equation) with which we can work. Namely we know 
that the corresponding Markov process on N has an entrance law starting at +00. This property is 
wellknown for the Kingman coalescent but extends to our birth and death process provided d > 0. 
This is easily extended furthermore to an additional immigration term. 

Hence the evolution (forward equation) can be started with "_B(0) = doo" o,nd for all solution 
we obtain a smaller path in stochastic order, in fact the entrance law from infinity has the property 
that in particular ^ B{t,j){l + j^) < 00, for t > {). Therefore we can solve the evolution equation 

h3.292\) in B uniquely. This has to be extended to the nonlinear equations where again starting 
somewhere in B after any positive time we reach the stronger summability condition. 

However if we truncate L to L^ where L^x — l{j<fei,La;, we can view \3.244^ o,s a nonlinear 
evolution equation in a subset of the Banach space IR.(8) L^(N, v) (recall \3.263\) ). This means now 
we are in the set-up of ODE's in Banach spaces and can use the classical theory of existence and 
uniqueness for such equations to obtain unique solutions B^ . 

Then we return to the original equation observing that for the solution B^ of the truncated 
problem we have for initial states with ^ i?(0, j)(*)^ < 00 as /s — > 00 

i 

(3.293) [B'^it))' ^ B'{t), B^{t)~^B{t), t > 0, 

using the bound in stochastic order from the entrance law as pointed out above. 

If we have existence of the solution in i?"*" ® L^{H, u) for all t (where i^(N, u) denotes the set 
of sequences in L^{N,v) having non-negative components) it suffices to show local uniqueness. 

Turn first to the existence problem. We shall obtain in Proposition \3.8\ that we have convergence 
of the finite N systems to a solution of the equations and we need for the existence to know that it 
is in our Banach space. It can easily be verified that the solution obtained in the limit must lie in 

X L?|^(N, i^) and has no explosions using apriori estimates. (Recall that for C^{t,i) the second 
moments (in fact all moments) are finite over finite time intervals since this is true for the process 
without collision which provides upper bounds on , independently of N.) Using the fact that 
second moments remain bounded in bounded intervals the proof is standard. Therefore a solution 
arising as a limit for N ^ 00 lies in the subspace o/R"*" x A^i(N) where \\ ■ \\ is finite. This gives 
existence of a solution with values in the specified Banach space. 

We turn to the uniqueness. The form of the equation now immediately gives the local Lipschitz 
property of F. Therefore the solution is locally unique by the uniqueness result for nonlinear 
differential equations in Banach spaces (e.g. JPa^, Chapt. 6, Theorem 1.2) applied to the truncated 
problem and together with the approximation property we get uniqueness of our equation. 
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(b) It is standard to check that ( is for given (u, U) and for (|3.268p - (|3.270p a nonexplosive time- 
inhomogeneous Markov process which is uniquely determined by these equations. We have to show 
that the self-consistency relation p.27ip can be satisfied. This will follow from our convergence 
result for {u^ , U'^) below in Proposition I3.8[ Part (e). 

(c) To prove existence of an entrance law with the prescribed properties, we construct a solution 
as a limit of the sequence {un{t), Un{t))t£R,n S N defined as follows: 

(3.294) Unit) ^ Ae"*" for t < t„ 

Unit) =U{oo) ioi t <tn 

[un, Un) satisfy (|3.287p and p.288p for t > 

Standard arguments imply the tightness of the sequence ?7„(t))tgR, n € N} and that any 

limit point (u, U) satisfies (|3.287p and (|3.288[) . We now have to show that the limit points satisfy 
the conditions on the behaviour as t — > — oo. 

We observe that the solution to the equation for u = Q,U = U{oo) gives a stationary solution 
of the coupled pairs of equations. We know that the growth by v4e"*,< S R is an upper bound 
for the u-part of the limit, since this is the collision- free solution of the equation and therefore 
limsup(e~'^*u(t)) < A < oo. We therefore must verify that the limit point matches exactly that 

growth and starts from ZY(oo) at time t = — oo. We note that (u, U) must then agree with a 
stationary solution for u = 0, which is unique by standard Markov chain theory and hence the 
point at t = — oo must be Z//(oo). We next argue that a limit point (m, U) actually satisfies e~^*uit) 
converges to A a.s t ^ — oo. We shall see below that this would actually follow if u(i)e~'^* has for 
t — > — oo A as limes superior. If we know that the solution with this property is unique, then our 
limit point in the construction must then have the desired properties and in particular we must 
actually have convergence of Un) as u — > oo. This however we shall show below in (d). 

(d) Let {u,U) be a solution satisfying (|3.283p . Then there exists a sequence t„ — > — oo such 
that as n — >■ oo the fimit of Uitn)) exists. 

Now consider the nonlinear system (^(t), Uit)). We must verify that as i — > — oo we actually 
converge to the case where u = and ait) = a. First note that by p.l30p we can assume that 
a{t) is bounded for all t. By tightness we can find a sequence (tn)neN, in ^ oo as n — > oo and 
A £ (0, oo) such that as n — >■ oo, M(t„)e~"*" — >■ A. Since by assumption limsupj^_(^ e""*M(t) < oo 
it follows from the equations that \a{t) — a\ < const ■ e"*. Then any such solution must satisfy 

(3.295) u(i)- Ae"* =o(e"*). 

We next note that given u{t), a{t), 7(t), the equation for U (t) is linear and has a unique solution 
by the standard Markov chain theory. Therefore it suffices to prove that uit), ait), j{t) as functions 
on M are unique given the asymptotics behaviour of u as i — >■ — oo. Remember that «(<) and j/(t) 
are functions U{t). 

We next consider the leading asymptotic term for U. Let iS"{t))t>o the semigroup defined in 
(|3.275p and S""'* is action on measure, U"'* its path, ?7"'*(oo) its equilibrimir measure. Furthermore 
let L{-) = ai-) + 7(-)- Let t„, t^ -oo so that tn - tm oo. Then by (|3.244[) 

(3.296) U{tn)^ lim [^"•*(<„ - t,„)f^(<m) + / " 5"'*(i« - s)(a - a(s))Z(C/(s))ds 

+ J^" uis)S'''*itn-s)L{s)ds] 

= lim [5"'*(t„ - tm)Uit„,)] + f " 5"'*(i„ - s){a - a{s))Z{U{oo))ds 
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+ / Ae"" S'"(i„ - s){a + -f)Z{U°''*)ds + o(e"'") 



U{oo)+ / S°'-*{tn- s){a- a{s))Z{U{oo))ds 



+ / Ae"''S'"'*(t„-s)(a + 7)Z(;7"'*(oo))ds + o(e"*"). 



Therefore we have 

(3.297) Urn C/(t„) = Z^(oo). 

Note that then also a(t„) — > a and 7(i„) — >■ 7 as n — >■ cx), since these are continuum functionals of 
U(tn) and we have the integrabihty properties. 

Finally, assume that there are two solutions. We shall show below that if two such solutions 
Ui) and (u2, U2) were distinct a contradiction would result. Hence in particular in (c) we must 
have convergence as claimed. 

Let 

(3.298) («(t), Vit)) := (ui(t), C/i(t)) - (w2(i), C/2(t)), a(i) = ai(t) - ^2(0, lit) - 71 (0 - 72(t). 

^From the above (i.e. p.295p and using (a — a{s)) — o(e"*) in (|3.296p and estimating therein the 
third term by explicit bound) wc know that 

(3.299) \v{t,,)\ = o(e"*") and ||y(t„)j| = o(e"*"). 
Then 

dv{t) 



(3.300) 



dt 
dVjt) 
dt 

da{t) 
dt 



= av{t) + e"*a(t) - (a + 7)e"*i;(0 - e^'^\5 + 7), 

= V{t)Q"'* ~ aV{t) + A{a + 7)e"*Z((y(i))) + (a + 7) • v{t)Z{U''^*) 

+e"*((S + 7 + ^(^(t))), 
= -aS(t) + A{a + 7)e"*a(Z(T/(t))) + (a + 7) ' v{t)a{Z{U"'*)) + o(e"*), 



where a(t) = a(F(t)). 

We then obtain by inspection from p.300p that: 



(3.301) 



d{Ht)\ + \\vm) 

dt 



<a-(l+const-e"')(|v(t)| + ||V(t)i|). 



Then use the v equation and p.299p , more precisely we use the V equation to get an expression 
for a{t) and 7(t) as a linear hmction of v. to conclude v{t) = as follows. Namely we conclude 
using Gronwall's inequality, 

(3.302) |t;(i„)l + ll^(tn)|| < o(e"*'")e"(*"-*'"). 
Letting <,„ — > —00 we obtain 

(3.303) \vitn)\ + \\V{tn)\\^0. 

This completes the proof of uniqueness of (u(i), U{t)). 

(e) It remains to show that not only f/(t, M+, •) is unique but that this holds also for C/(t, •, •). 
Note that our result already implies that u{-) is uniquely determined. Note also a{t) and 7(t) 
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depend only on U integrated over the age and are continuous functions of t and are therefore 
also uniquely determined. Therefore ti(-), a(-), 7(-) are uniquely determined. Hence we can insert 
the unique objects as external (time-inhomogeneous) input into the equation. Then the equation 
p.249|) for given u, a, 7 is the forward equation of a time inhomogeneous Markov process with 
state space Mi([0, 00) x N) restricted to elements in R (g) i^(N, ly), which has a unique solution by 
standard arguments. 

This completes the proof of Lemma 13.61 



3.2.8 The dual process in the collision regime: convergence results 

Now we are ready to state the limit theorem for the growth dynamics of the dual population in 
the critical time scale where collisions are essential, with the key result for the application to the 
original process given in e) of Proposition [3781 below, see in particular (j3.320p . As usual we denote 
by =^ convergence in law. 

Consider the dual process which starts with k particles at each of £ distinct sites and denote the 
corresponding functionals of the dual process by (11^''^'^, u^'''^^, U^''''^)s>o- In order to consider 
them in the time scale in which fixation occurs, we introduce: 

(3.304) u'^'^^'it) = +t)yO),te (-00, T], = i 

a a 

(3.305) [7^.M(i)^[/^,M((i^+i)vO), te (-oo,T], [7^^M(_i^) ^ . ^ 

a a 

Furthermore we need the standard solution of the nonlinear system p.248p and (|3.249p , (see Lemma 
13.61 part (c)) denoted 

(3.306) iu*{t),U*{t,;-)))t>t„. 

which is the solution satisfying (recall Uipo) was the stable age and size distribution) 

(3.307) lim e""*u*(i) = 1, lim U*{t)^U{oo). 

Finally we have to specify three time scales. Let stand for some element of M and let to{N) and 
s{N) be as follows: 

loE N 

(3.308) to{N) = siN) - 

a 

with 

(3.309) s{N) 00, s{N) = o{\ogN). 

RecaU the time shift in (|3.304p and p.305p . 

The behaviour of the dual particle system is asymptotically as A'^ — > 00 as follows. 

Proposition 3.8 (Growth of dual population in the critical time scale in the N 00 limit) 

(a) Assume that for some to ^ M. as N ^ 00, {-^u'^''''^{to), U^'^'^ito)) converges in law to the 

pair [u[to) {U))) automatically contained in [0, 00) x Li(N, z^). 
Then as N ^ 00 



(3.310) C 



CMt),U{t,;-)))t^,,]^ 



in law on pathspace, where the r.h.s. is supported on the solution of the nonlinear system ^3.248^ 
and {8.249^ corresponding to the initial state {u{tQ),U{tQ)). (Note that the mechanism of the limit 
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dynamics does not depend on k or £, but the state at to will. That is why we suppress k,£ on the 
r.h.s.) 

(b) The collection {{u^'^'^ ^U^'^'^),N £ N} can be constructed on a common probability space 
such that for t(){N) as in LS. 308\) we have: 

(3.311) {e-''''^^^u^''''\toiN)),U^''''\to{N))) -> {W'''\U{oo)), a.s., as N oo. 

(c) The scaled occupation density converges (recall \3. 304\ l and iS. 305\] and ^3.307\ l): 

~N k I 

(3.312) ^^(t) => u''''{t), t G (-CX3, T], 

in the sense that for e > (with tQ{N) as in i3. 308]} }, 

( u'^.kj \ 

(3.313) P\ sup e-"*|— — (i)-u'=''^(t)| >e ^0 asiV^cx). 

\to{N)<t<T N J 

Also the age-size distribution of the dual converges: 

(3.314) U^'-'^-'it, •) =^ U^-\t, ; ■), t e (-0O, T], 

in the sense that for rj > 0, with \\ ■ \\ denoting the variational norm, 

(3.315) lim P( sup \\U^-''-^{t,-,-)-U'''^{t,-r)\\>v] =0. 



N 



to{N)<t<T 



(d) The limits (u'=^^(t),C/'='^(t, •,-)))t>to, can be represented as the unique solution of the non- 
linear system \3.248^ and \3.249^ satisfying 

(3.316) lim e~"*u'='^(t) lim U^^\t) ^ U{oo), 



with W^'^ having the law of the random variable appearing as the scaling (by e limit for t — > oo 
of the CMJ-process ' started with k particles at each of i sites. 

These solutions are random time shifts of the standard solution 113.306]} . ^3.307\ }, namely: 

(3.317) u'^\t)=u*it+'^^^), U'^\t) = U*it+'-^^^). 



(e) Let {Ll^'^'^)u>o denote the number of dual particles and Tjv = a ^logN and to{N), s{N) 
as in i3.308\) . Then for each k eNJ eN we have: 

(3.318) L\{N ^Tl,+f ^o(N) <t<oo}\=^ C[{ }tm]], 

(3.319) C[{N-^Il^^''^^ , < u < s(7V))] =^ So, 

with the function on R"*", which is identically zero. 

Furthermore for every i S M, there exists (a deterministic) i'k,i{t) G 7^([0, oo)) such that 



(3.320) £[1 / n^'^'^H =^ t.,,,(t). □ 
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Remark 26 Note that the dual at time T describes the genealogy of a typical sample from the 
original population drawn at the time T . Here time runs backwards, in particular the evolution of 
the dual with total time horizon T = \ogN + t in the time described by varying t corresponds 
in the genealogy of the tagged sample to the early moments, while the emergence phenomena of the 
dual population ( arising collisions ) reflect in the original population later times, where law of large 
number effects rule the further expansion of the fitter type on macroscopic scale (i.e. observing the 
complete space). 

In the case in which the rare mutant has succeeded our result tells us that we have a situation 
where the mutations which occur very early in the typical ancestral path have generated 0{N) 
possible choices for a fitter type to occur and to then prevail at the observation time at a fixed 
observation site. The randomness enters since these very early mutations during the first moments 
of the evolution have in the limit iV — >■ oo a Poisson structure. This produces a random time shift 
of emergence and takeover which then follows a deterministic track. This is the global picture, if 
we look at it locally we find that also a collection of tagged sites follows a random evolution in small 
time scale. 



Proof of Proposition! 

We begin with some preparations needed for all of the proofs of the different claims of the 
proposition, this is part 0, followed by three further parts explained at the end of part 0. 
Part (Preparation) 

For the proof we represent the process differently, namely we rewrite the problem in measure- 
valued form. We will need the following functional of the process 11^, denoted respectively 
we indicate the initial state in the notation (here Si = ): 



(3.321) U^ = Y,CAt)l{CAt)>l) 



which counts at time t only those dual particles which arc not the only ones at their site, i.e. they 
are the ones which can generate new occupied sites. Here C^{t) is the single site birth and death 
process starting at time s, the time where the site was first occupied, with mean-field immigration 

at time t given by c—j^ o(l)asA^— >cxd from the other sites. 

Note that we have now made the iV-dependence explicit. Also note that by definition of , 
and we have 



(3.322) n^i=w^(t)/ yjU''{t,ds,j). 



i=2 

To continue we introduce a new random object from which we can read off {u^ , U^) by taking 
suitable functions. Namely we consider 'i>^{t, ds, dx) the A^fin([0, oo) xNq)- valued random variable 
(finite measures on [0,oo) x Nq) defined by 

(3.323) ^^it,ds,dx) - ^t-su-{t))^ 

Si<t 

where {si} (recall Si = sf) denote the times of birth of new sites and i runs from 1 to (this 
set of times depends on N). Note that here empty sites are not counted. We know from the 
CMJ-theory that *^(t,-,-) is bounded by VF*e"*. Since we are interested in times a ^ log N + r 
we introduce the scaled object: 

(3.324) ^f = ^±Ll. 
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Note that for fixed N this is an element of Alfi„([0, oo) x No). 

Next note that indeed we obtain our pair {u^ , U^) as a functional of Observe: 



(3.325) u^(t) ^-f ([0,oo) X N), U^{t,-,- 
and the stochastic integral below gives: 

/•t /'t /'OO 

(3.326) n^= C{t)dK^= / a:M'f(ds,dx). 

Jo Jo Jo 

Therefore from p.325p we obtain the convergence of {N~^u^ , U^) in the time scale a^^ log N+t 
with t G M if we show the convergence of This will also allow us later on to argue that the 
mass and time scaled process 



(3.327) Ii^{t)=N-^nt,N 



+t 



converges as A'^ — >■ oo to a limit. 

The proof is broken into three main parts. The first two parts concern the pair (w, U) the third 
one the quantity N~^Il^^J^f.. The parts can be outlined as follows (where (a), • • • refers to the 
parts of the proposition): 

1. Assuming the marginals of {N^^u^ ,U^) converge in distribution at t = to, prove (a) that 
the limiting evolution during t > to follows the deterministic nonlinear dynamics specified in 
the system of coupled equations p.248p . p.249p and that pathwise convergence holds. 

2. Then prove (b) that the marginal distributions converge for to ~ to{N) ~ ^ log + s, s G R 
and show (c), (d) on the corresponding entrance behaviour. 

3. Show the convergence of the process {N^^Il^^^^,t E M} and of its time integrals to complete 
the proof of (e). 



Part 1 (Convergence of dynamics of ^t^) 

Let e 'P(L'([<o,oo),7Wfin(M+ x N))) denote the law of the Markov process 
(3.328) {$^?„+J,>,^ (recall (0321) with = 

In this part we assume that the marginal distributions Pj^ converge as — > oo. Our next goal 
is then to show that the processes converge in law on path space, that is, P^ ^ P as N ^ oo 
where P is the law of a deterministic dynamics. 

Recall that we have defined the candidate for the limiting object by a nonlinear equation, which 
we can view as forward equation for a nonlinear Markov process on R+ x N. This means we want 
to show that if we consider the marginal laws, that is, elements P^ of ■p(A4fin(K+ x N)) they 
converge to an element (5^^ with 'i't S 7Mfin(]R+ x N). We will now take the viewpoint of backward 
equations and their generators, that is equations for expectations of measures rather than actual 
weights. 

In order to do so in this present Part 1, we proceed as follows. 

• we obtain moment bounds on "fTiv+t uniform for iV e N and to < t < to + T , 

• we formulate the martingale problem for with generator G^v given by a linear operator 
on the space Cb{Mfin{^+ x Nq))) with common domain V given by a certain dense subset 
of Cb(A^fi„(M+ xNo)), 
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• we verify the tightness of the laws P , 

• we prove the weak convergence of P^ to P by showing that Gjv converges pointwise to G 
on these functions in the domain V where G is the operator defined in p.259p . 

• verify that hmit points of (P^)jveN are characterized as solutions to the G martingale prob- 
lem, 

• conclude the convergence since we have proved that the G martingale problem has a unique 
solution given by the nonlinear system (|3.248|) . p.249p . 

• the convergence for fixed to is obtained later in Part 2. 
Abbreviate for bounded measureable test functions / on [0, oo) x No 



(3.329) *(/) 



f{r,x)'^{dr,dx) 



oo 



and in particular ^'(1) = 4'([0,oo) x (No)). 

The process is a Markov process with state space A^fin([0, oo) x N). We calculate the action 
of the generator Gat acting on the domain V of functions Fgj of the form (I3.255P as follows. Let 
$ e 7Wfi„([0,oo) X N). Then: 



(3.330) 



{GNFgjm = 

c(i-$(i))(rrA^[.9($(/)4 

+ 119' (Hf))^Wr.dx) 



/(0,l)+/(r,.T-l)-/(r,x) N 
N 



)-g(*(/)) xl,^i*(dr,d.T) 



1 

oo oo 



J JsxN gi^if) 



1 

oo oo 



f{r,x + l)-f{r,x) ^ 
N I 



+ JJlx{x~l)N g(vl>(/)+ /(''-^-iW(^.^) )_g(^(j)) ^^dr,dx) 



OO OO OO OO 



-C^WIIIJN 5(*(/) 
10 1 '- 



f{f,x+l)-f{r,x) 
N 



x'9{dr, dx) 



•i'{df,dx) 
*(1) ■ 



Note that in order for the right side to be well-defined we require / x'^'^{M., dx) < oo. This condition 
is automatically satisfied if G Alfin([0,oo) x No) H B. This means that GNFgj is a wclldefined 
function on B, where it is even continuous. Note however Cb(B, R) is not mapped in Ch(B, M). This 
means we now have to investigate next that under our evolution we stay with being B. Therefore 
we turn next to bounds on the empirical moments of the number of particles at the occupied sites. 

We now establish some apriori bounds which will guarantee that the empirical moments remain 
stochastically bounded over the considered time interval. By definition we have 



(3.331) 



u'\TN+t) oo 



N 



fe=i 



Then we have with Si denoting the time of birth of the z-th colonized site (recall Si 
letting 5* be the birth time of a randomly picked site: 



) and 
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(3.332) 



E 



ti"(Tjv+t) oo 
i=l fc=l 



E 



u"(T„+t) 

T7 E (ci!r(7^^+o 



< 



and by Cauchy-Schwarz applied to the m-th process of Q we get 



(3.333) 



E 



«"(Tjv+t) 



\ i=l J i=l 



We can obtain an upper bound for x™'^^^^^{R,dx) by comparing with the coUision free 
case (recaU construction in Step 2 of Subsubscction 13.2.61 that is, the total number of particles is 
bounded by the collision free case) to obtain 



(3.334) 



hm P 



sup / x'i! 



to<t<to+T Jo 

MJv(Tjv+t) 



TN+ti 



< lim P 



, dx) > L 



N 



i=l 



to<t<to+T 



< ^-^ E b^Pto<t<t„+T{Cs{t)f] ^ 
~ L— i-oo 

using the independence of the {Cs} in the collision-free case and the martingale problem together 
with a standard martingale inequality for the birth and death process Cs{-)- 

We have seen that the first empirical moment bound p.334p holds by comparison with the 
collision-free regime. We now have to establish analogous results for higher empirical moments 
and incorporating collisions which involves additional immigration from the other colonies at each 
occupied site. Let K > 0. Let 

(3.335) C.'^'it) 

denote the birth and death process with immigration at rate K which starts at time Si. This process 
is ergodic and has a unique equilibrium. Let mn{K) denotes the corresponding n-th equilibrium 
moment. 

Remark 27 We used above the fact that the birth and death process with linear birth and quadratic 
death rates has moments of all orders at positive times. This can be verified by comparing with 
a subcritical branching process with immigration. Moreover the corresponding Markov process on 
N has an entrance law starting at +oo. This property is wellknown for the Kingman coalescent 
but extends to our birth and death process provided d > and is easily extended to include an 
additional immigration term. 
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Then up to the time 



(3.336) TK :== mf{t 



Tn + 



tiR,dx)>K} 



we can verify with a stochastic niontonicity argument that we get a bound uniformly in N: 
(3.337) E[{C^'^ {■))''] < mn{K) < oo, for ji,N e N. 
Now recall that (writing F^.f if g{x) — x) 



(3.338) ^^^+,{f) - ^t+toif) ' 



TN+t 



GNF,ji^^^+M.dx))ds 



is a martingale. Using the above moment inequalities, (see p.333p ). we can show that this is an 
martingale and we can then use a martingale inequality to show that for / with finite support 



(3.339) 



lim supP( sup *T„+t 

N to<t<{ta+T)ATK 



if) > L) 



< lim sup — { E 



o(l/l) 



Tn+T 
Tjv+to 



GNF,j{%^^^M,dx))\ds 



+ lim sup— £;[(M(^^^.^)^,^)^] 
= 0. 

For every n > 2 we can now choose a sequence (/m)meN with finite support in 
fm{x) t 2;" verify that for given K 



V) and let 



(3.340) lim supP 

L— f oo ^ 



sup 

ta<t<{to+T)^TK -Jo 



Tjv+t( 



^, dx) > L 



0. 



We next note that equation p.334p implies that 

(3.341) t oo as A' oo 

uniformly in N. This together with p.340p implies that the evolutions {$f , t e [Tn + to, Tn + 
to + T]}, N € N and their weak limit points are concentrated on path with values in B. 

We can conclude now with noting that GN{Fg_f) is a finite and even continuous function on 
B, even though Gjy docs not map functions in its domain C Cf,(B,]R) into Cb(B,R) but only into 
the set C(B,R) of continuous functions on B. Therefore we will have to show first that for the 
starting points of the evolution we work with, namely the processes {'i>^{t),t S [to,r]} and the 
limit points of their laws are concentrated on states satisfying this extra condition to be in B. This 
is now immediate from the a priori bound p.340p . 

We now verify first tightness in path space and then in a further step convergence of the 
distribution. 

Tightness We verify the tightness in path space that is the tightness of the laws 

(3.342) {P^ = /:[($f )t>tj; iV e N} as iV oo, 

assuming convergence of the marginals at time tg- Here we deal with a sequence of probability 
measure-valued (on a Polish space) jump processes. With the help of Jakubowski's criterion, 
(see Theorem 3.6.4 in [D]) we obtain that it suffices to consider the tightness of laws of real 
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valued scmimartingales, namely of {C[{Fgj{'i^))t>o], admissible / and g}. In fact g{x) ~ x and 
g{x) ~ x"^ (recall Remark [25)1 alone suffice according to that theorem and for these two functions 
g and all / with < / < 1 we shall now verify the criterion. 

Recall first the apriori bound (|3.340p . The tightness of the laws of Fgj o vj/^ is is obtained 
using the Joffe-Metivier criterion ( |JM| ) in the form of Corollary 3.6.7. in [D], applied to Fgj for 
g{x) = X and g{x) = x^. The bound p.340[) verifies the conditions therein. This means that now 
{£[(^y^_(_j)t>to]: ^ G N} has limit points in the set of laws on Skorohod space. 

Let denote the law of {$^}t>to on the Skorohod space D([0, oo), 7Wfin(K+ x N) n B) 

with initial values ^Pt,, G 7Mfin(K_|_ x N) n B. By tightness we can choose a convergent 

subsequence. Since the largest jumps decrease to the limit measure is automatically concentrated 
on C([0,(X)), A^fin(ffi+ X N) n 1) (see [EK2] . Chap.3, Theorem 10.2). Convergence in law on the 
Skorohod space follows if any limit point satisfies a martingale problem with a unique solution. 

Convergence Next we turn to the convergence in f.d.d. by establishing generator convergence. 
For this purpose we consider A^fin(K+ x N) n B as the state space for the processes 4'Ar(-) and the 
function space Cf,(A^fin(R+ x N) n B,R) as the basic space for our semigroup action. Recall the 
algebra of functions Do{G) given in p.259p . The operator Gat is a linear map from Dq{G) into 
C(Xfin(K+ X N)n B,R). 

We consider the convergence for two choices for the "initial state" at some time to = to{N), 
namely we assume that ^I^^ converges as — > oo, but either 

(3.343) $i^(l)^0 or $^(1) ^ a; > 0. 

In the first case we would be back in the linear regime (no collisions) we discussed in Subsub- 
section l3.2.7l and which we have identified in terms of the CMJ process. Next we consider the limit 
corresponding to initial points at some time to such that ^'^(to) does not converge to the zero 
measure. 

Let / > 0, ll/lloo < 1, $(1) < 1, 3 with ,g|([o,i],R) € ^^^([0 ,1],^) and Fgj as in (IX^ . 
Recall the definition of the operator G on Do{G) given in p.259p . Then there exists a constant 
(const) depending only on c, d, s such that: 

const r°° ^ const ^ 

(3.344) |G^(F,,^-)W-G(^^,j)(vI/)|<^|!g"l[o,i]||ooy^ x'^{dr,dx) < ^IMm. 

We get that for all Fgj of the form in (|3.255p with the above extra restrictions that 

(3.345) G^iFgjm G{Fgjm, 

provided that the argument ^ satisfies 

nOO poo 

(3.346) / / x^%{dr, dx) <oo, 
Jo Jo 

which is the case for e B. More precisely, considering the functions G^iFg), G{Fg) on any finite 
ball in B, we have 

(3.347) GNiFgj) ^GiFgj), iV ^ oo 

pointwise uniformly on such balls. 

From ()3.347p and the convergence of the initial laws we want to conclude that weak limit points 
of C[{'i>^)t>to] conditioned on a particular initial state at time to are (5-measures on the space of 
A^fin(lR~'' X N)-valued path which must satisfy the evolution equation 



(3.348) A($,) = G*$t , t>to. 
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For this we have to show 

1. the variance process of < / > converges to zero, and 

2. remains in large enough baUs with high probabihty such that p.347p can be apphed, 

3. making the choice g{x) = x in p.255p . to verify that the path solves equation p.348p . 

4. to prove convergence it then suffices to show that the G-martingale problem is wellposed. 
For the point (f ) consider g{x) = x^ and g{x) = x and show that 

(3.349) lim [GNiF^^j) - 2F,jGnF,,j] - 0. 

Note that replacing Gat by G the relation says that G is a first order operator and this is read 
of from the form of G directly. This can be verified also directly using p.259p and (|3.344p . This 
implies that the variance of a solution to the limiting martingale problem is 0. Since the higher 
moments of (^'^, /) are bounded this implies (1). Point (2) is given by the apriori estimate (|3.340p . 

For point (3) we take g{x) = x and conclude that the limit point is given by the unique 
trajectory in B which satisfies 

(3.350) / / f{r,x)^t{dr,dx)- / / G'^^f{r,x)%s{dr,dx)ds = 0, 
Jo Jo Jo Jo Jo 

that is, the nonlinear system p.254p (recall Lemma 13. 6p . In other words, any limit point is a 
solution to the G martingale problem. For point (4), we have established the uniqueness and 
properties of the solution to the G martingale problem in Lemma 13.61 and Corollary 13.71 

To complete the proof of convergence, given "i/to and T, find A'o > such that the solution of 
the nonlinear system stays in the interior of the ball of radius Kq during the interval [to, to + T] 
and let K > 2Ko, say. Then it follows from the above that we have weak convergence up to time 
{to +T) Atk for every K > and therefore for e > 

(3.351) lim P[ sup - $t|| > e] = 0. 

N^oo to<t<to+T 

This completes the proof of part (a) of the proposition. 

Part 2 Convergence of to-marginals and properties of {u^ ,U^) 

Our goal is to prove part (b) and (c) of the proposition under the assumptions of convergence 
at a fixed time \ogN + to- 
Proof of (1) 

For that purpose we prove first the convergence of the marginals at times logN+to, starting 
with "<o = — oo" , (|3.31ip . and then later consider G to finally come to the convergence of the 
path. 

Proof of (|3.31ip Here we have a time scale where as TV — J- oo the collisions become negligible 
since the observation time diverges sublogarithmically as function of N . Using the multicolour par- 
ticle system (see Subsubsection we can build all variables on one probability space. From this 
construction it follows that collisions become negligible and we have the claimed convergence from 
the one for the collision- free system proved in Subsubsection [572.41 see (|3.143p . and Subsubsection 
13.2.51 see (|3.166p via the CMJ-theory. This proves part (b) of the proposition. 

Proof of (c). We now have to prove the convergence result for the marginal of ('I'^)tgR at 
times t = log N + to, where we can choose io G 1^ as small as we want since we know that the 
dynamic converges to a continuous limit dynamic. This means we have to show (|3.312p . (|3.313p 
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and p.314p . Finally we have to relate the limit of {N~-^u^ ,U^) as — ^ oo to the system of 
nonlinear equations to then prove (|3.316p and p.317p . 

Proof of (|3.312|) and (|3.313p We recall that we proved in the previous part the convergence 
of N~^u^ pathwise if we consider path for t > to and if we have convergence in law at t = to. 
Therefore it suffices now to show that N^^u^ (to) converges (in law) as — > oo. 

To do this we first note that the time-scaled and normalized (i.e. scaled by A^~^) number 
of sites occupied at time a^^logA^ + to, denoted N^^u^ , does not go to zero as iV — >• oo nor 
does it become unbounded. This was proved in Proposition II. 71 This proves tightness of the one- 
dimensional marginal distributions at times a^^ log N + t and that limit points are non-degenerate. 

The tightness gives us the existence of limiting laws for {N'-^u^ ,U^) along subsequences for 
time \ogN + s for fixed s. Since N^^u^ < We"^ any limit point must satisfy 

(3.352) limsupe""*u(t) < oo and \\U{t)\\ bounded. 



Therefore by Lcmma l3.6r d) any limit point corresponds to a solution to the nonlinear system given 
by (|3.28ip and p.282p and any two such solutions are (random) time shifts of each other. Hence 
we have to show that each pair of limit prints belongs to the same time shift from the standard 
solution of the system with growth constant 1 . 

Recall the definitions of r^(e) (which is the first time where u^''''^ reaches [sN\ for all e S 
(0,1)), (e) from (|3.183p . (|3.184p . Now assume that there are two limit points and let the 
corresponding limiting passage times be denoted t^{s) and T^{e) such that T^"i''^{e) — >■ r*(£), i = 
1,2 in law. But by (UMI, P[\T^"--''^ie) - T^"i'^{e)\ > 2C{e)] ^ 0. But since C(e) ^ as e ^ 0, 
the time shift must be arbitrarily small, which means equal to zero and therefore the two limit 
points coincide. This immediately proves that T^(e) converges for any e > 0. 

We now elaborate on this argument. To show convergence of the marginal distributions it 
suffices to prove the convergence in law of the normalized first passage times T^rin(e) := (s) — 
{a~-^ logiV). We give two arguments which verify that convergence of the r^ormi^) is sufficient. 

(i) Given the value u{s) we know exactly the first passage time for all e > u{s) (due to the 
unique and also deterministic evolution from any initial point). Indeed using the fact that the path 
converges (for convergent sequences at a fixed time) , the collection is monotone non-decreasing and 
cadlag, we know that the whole collection (in e) of first passage time converges in path space. The 
uniqueness of the limit points follows since two different limiting distributions at times log N+s 
for fixed s would not be compatible with the uniqueness in distribution of the limiting first passage 
times, since the limiting dynamics {u{t))t£M is monotone for sufficiently small (i.e. negative t) so 
that it is uniquely determined by the first passage times in that range and hence everywhere. 

(ii) Alternatively we can use the fact that there is according to Lemma 13.61 up to a time shift 
a unique entrance law from at i — > — oo which is compatible with the growth conditions of a 
limit of that quantity as A'^ — > oo. Therefore by one first passage time this random time shift is 
determined and hence it suffices to prove convergence in law of one first passage time concluding 
the second argument. 

Having proved the reduction of the convergence statement to one on Tj^^mj, we have to show 
next that for one e G (0,1) and therefore by the above for every such e: 



(3.353) 




logiV 



a 



N 



with 



(3.354) T{e) = inf(t e M.\u{t) = e), 

where u is the (unique) entrance law with e"l*lu(t) — > W (recall Lemma [3^ . 
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Wc have two tasks, to identify the hmit on the r.h.s. of p.353p as the quantity defined in 
(|3.354p this however we showed in part f where we estabhshed in particular the convergence of 
the dynamic for the frequency of occupied sites, and secondly we have to show the convergence in 
(|3.353|) . So only the latter remains. 

We use our results on the corresponding times -r^(e), assuming no collisions occur, namely we 
see that it follows from Proposition 13.51 that {e) converges to a limit t(£) defined in terms of 
the collision-free process. Hence it remains to bridge from T^(e) to (e). 

To relate T^(e) and T^(e) we will use that we can use according to the above arguments 
arbitrarily small e > and we can work with the multicolour particle system which we introduced 
in the proof of Proposition 1 1 . 71 below p.l76|) . 

Consider e | 0. We have the fact from the multicolour particles system (recall p.l95p and 
(|3.197p ) that for sufficiently small e asymptotically as ^ oo in probability 

(3.355) f(e) <T^(e) <r(e), 
where e solves 

(3.356) e- Const e2 = e, £>0. 

Note that the Const is less than 1, so that there is a unique e> e such that 

(3.357) e-e==0(?^). 

In (|3.355p the right inequality holds for e small enough and N sufficiently large, while the l.h.s. 
holds even for every N. 

Denote by T^ij„(e) the normalised T^{e), i.e. {e)~a~^ log A^. We know that (e) — > T{e) 
as — > cx) if wc use the coupled collection of processes we constructed in the proof of equation 
p.312p . Hence for £o sufficiently small realizations of two different limit points T^i^-^{eo) , T^i^-^{eo) 
satisfy on a suitable probability space: 

(3.358) T{e) < C;.i,(e), ^^(e) < ?(?). 

Therefore \t^I^{s) — T^rm(£)l — I'^i^) ~ ^(^ll- We furthermore know that T{e) is given by (loge — 
logTy)/a so that the l.h.s. and the r.h.s. of p.358p differ by a deterministic quantity, which equals 
const • £0 as Eo 4- 0. As TV — >■ cx) we have a limiting evolution between T^(eo) and (s) which is 
deterministic. This evolution is a random shift of some deterministic curve. Then it is not possible 
to have two limit points for the law T^(e), since their difference are shifted versions of a given 
curve would translate into the same difference but now bounded above by arbitrarily small eg and 
therefore equal to 0. Hence all limit points are equal and Tjf^rm(e) = {t^ (e) — a~^\ogN) must 
converge as A'^ — > oo as claimed and we are done. This concludes the argument for the convergence 
of the marginals at time log A^ + to- 
Proof of ()3.314p We first need some preparation, involving the collision-free regime. Turn to 
the dual particle process and recall that at time log log A^ asymptotically as A'' — >■ oo there are 
VFlogA^ occupied sites and that at time a~^loglogA^ the empirical age and size distribution is 
given by the CMJ stable age and size distribution Z^((X), •, •), since we are in the non-collision regime 
due to loglogA^ << a~^logA^. Therefore at time riogA? the age distribution is (asymptotically 
as A^ — > oo) given by the stable age distribution. Moreover for each realization of W there is a 
time shift of the limiting dynamics which is a function of W. Since the evolution from this point 
T\ogN on is asymptotically deterministic we conclude that the age and size distribution at 
time \ogN ~ tpf + t, with t^ oo but tjv << ce~^ log A^, converges to a time shift of the one 
determined by equation p.248p and p.249p with initial condition given by u — and U given by 
the stable age distribution. We now have to bridge from times a'-^ log N — tN + t to log N + t, 
where collisions play a role. This runs as follows. 
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For a given approximation parameter e > wc choose a time horizon a^^ logiV + T(e) such 
that {K^ Xi^)t>o is, as — > oo, e-approximated by the colhsion-frec process {Kt,Ct)t>o up to 
time logiV + T(e) if wc choose e sufficiently small and this implies the approximation for the 
functionals C/^ by Z^/ as — > oo for e sufficiently small. More precisely we require that we can 
find a coupling between C/^ and U such that (|| • || denotes variational distance) 

(3.359) limsupPj sup \\U^ {t, ■ , ■) ~ U{t, ■ , ■)\\ > e] ^ as e ^ 0. 

In other words wc have to show that by the collisions up to time log N +T{e) at most difference 
e develops between and U for very large N. 

To show this note that we can bound the total effect of collisions occurring up to time log N+ 
T{e). Namely we return to the multicolour particle system defined below (|3.176p . which shows that 
we can relate the difference of the system without and with collision through the black and red 
particles. In order to show that \\U^{t, •, •) — U{t, •, •)|| < e it suffices to show that the proportion 
or red and black particles is o{e) in probability as — >■ oo. But this was verified for the number 
of black sites with T{e) = T(e) in (|3.195p . which immediately by using that C is stochastically 
bounded gives the claim for the number of particles. Since the number of descendants of red par- 
ticles grows at a slower rate as the black and new red founding particles occur at the same rate as 
black ones then the claim follows. This completes the proof of Part (c). 

Proof of (d): ()3.316p and ()3.317p 

Again we first turn to the collision-free process. We approximate (recall ZY refers to the collision- 
free CMJ- process) U{t, ■ , ■) hy U{(X) , ■ , ■) . By [N] (Theorem 6.3) we know that Z//(r, •, •) converges a.s. 
as r ^ +00 to the stable age and size distribution U{oo, ■,■) of the Crump-Mode- Jagers process 
(see (|3.164p ) and hence 

(3.360) U{a-^ \ogN + t, ■, •) — > U{oo, •, •), a.s.. 

Now take collisions into account and consider the limit of JJ^ -|-^, •, •) as iV — ^ oo, t — oo. 
Combining p.359p and p.360p it follows that for 77 > there exists Tij]) such that 

(3.361) limsuppf sup ||C/^(i^^ + t), •, •) - Z^(oo, •, OH > 2?/ ) < ?/. 

N^oo yt<T(i)) cy. j 

Therefore 

(3.362) lim U{t,-,-) = U{oo,-,-). 

i— f — 00 

Finally we have to verify that indeed (e~"* "J*'' )tgR converges to a limit (e~"*it(i))tgR such 
that e^"*it(i) — > W^'^ as i — > —00, where W^' arises as the growth constant of the collision- 
free process. This was already essentially done in the proof of p.31ip . if we observe that the 
difference between the collision- free process and {t) is bounded by the number of black sites, 
whose intensity vanishes as t — > — 00, since 

(3.363) lim sup P[T^(e) < a^'^logN + t] ^ as t ^ -00 

and hence the intensity of black sites is 0(£^) at time (e) for N large. 
This completes the proof of part (d). 

Part 3: Occupation density of dual particle system 
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Proof of ( e) 

Note first that (j3.319p follows from what wc proved for the collision-free regime, in particular 
the fact that sup{e~ Kt\t > 0) < oo. 

(|3.320p Next note that it suffices to prove the convergence in p.318p to obtain (|3.320p using 
p.319p . Namely we then have pointwise convergence on a suitable probability space and an upper 
bound is given by the collision-free process with single sites in equilibrium, which provides an 
intcgrable integrand and is independent of N and indeed using Proposition 13. 8[ part (b) we see 
that (|3.320p holds. 

(|3.318p Using Proposition 13.81 part (c) we see that for p.318p we have to reduce the problem 
concerning u^U^ and uU to what we showed there for and u respectively C/^ and U . Namely 
each of them converges weakly. However since 

(3.364) (u(0,t/(t, •)) — >u{t)U{t,-) 

is a continuous mapping this follows from the convergence properties we proved for {N~^u^ , U^) 
as — )• oo. 

This completes the proof of Proposition 13. 81 

Remark 28 Consider the normalized first passage times f^(e), where T^(e) is the first time 
where n^'*^'^ (rather than K^'^'^) reaches [eA^ J . 
Then one can show that for every e G (0, 1); 

(3.365) £[f^(e)_l^]=^£[f(e)], 
with 

(3.366) f(e) = inf (t e W\u{t)U{t, R, N) = e), 
and u is the entrance law with lim e"'*'u(<) = W . 

f oo 

3.2.9 Dual population in the transition regime: asymptotic expansion 

The purpose of this subsubsection is to use and extend the techniques we developed above to later 
be able to connect the early times (studied later in Subsection 13. 3p and the late times studied in 
Subsubsection 13.2.81 above. 

Return first to our original population model. On the side of the original process the limit in 
the first time scale, i.e. of times 0(1) as — > oo, is described by the branching process (II™)t>o 
and in the second time range, i.e. after times Q;~^logA^ by the deterministic McKean-Vlasov 
equation but with random initial condition. For both these limits we have derived (respectively 
for 3™ we will in Subsection 13. 3p exponential growth at the same rate a and for the first we had 
obtained the growth constant W* and for the latter the growth constant *>V. The objective below 
is to show that we can consider the limit simultaneously in both time scales in such a way that 
the final random value arising from the branching process describing the number of type-2 sites 
at small times provides the initial value of the McKean-Vlasov equation describing the evolution 
from emergence on till fixation meaning that in fact *W and W* have the same distribution. 

This section provides the analysis of the dual needed for this purpose. The completion of the 
argument is obtained in the following subsubsection which is devoted to the limiting branching 
process and the transfer back of properties of the dual to the original process. 

Recall the definition of u^, in p.235p - p.239p . In this section we deal with the relation of 
the two basic limits as iV oo of the functional of the dual population given by (u^ (t), (t)), 
namely, the limit in the two different time scales 

loe 

(3.367) tN = o(Tn) and tN ^Tn + t, where Tat := — and t € R, 

a 
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as well as the transition between these two time scales which separate in the Umit — > oo. 

The main tool developed in this subsection is the analysis of the dual population in this two-time 
scale context. For that purpose we make use of two techniques: (1) Nonlinear evolution equations 
for the dual particle systems and their enrichments, (2) couplings of particle systems to estimate 
and control the effects of collisions in the dual process by comparing it with collision-free systems, 
i.e. the dual of the McKean-Vlasov process. 

Heuristically we can approximate /N by which are the solutions of the ODE 

(3.368) ^ aj,{t)u^{t){l - ^-^u^{t)), t > to{N), 
at N 

(3.369) = 1 + S^(io(iV))=«'^(to(iV)) = iy^e"*«W, 

which allows to get some feeling for the behaviour based on explicit calculation for the case where 
aN and jn are constant and explicit formulas for the solution of the ODE (|3.368p and p.369|) can 
be given. 

There are two regimes depending on which of the time scales in (|3.367p is used: the first (linear) 
regime where N~^u'^ (tN) — > as iV — oo together with aN{tN) ol and jN{tN) -> 7 and the 
second (nonlinear) regime where u^(<Ar) = 0(N) and where aN{tN),jN{tN) differ from a, 7. 

We have proved in Subsubsection [3 . 2 . 31 that for times tN ^ —00 the number of sites occupied 
by the dual process at times + t^ and multiplied by converges to zero in probability. 
On the other hand we saw in Subsubsection 13.2.71 that assuming that in law 

3.370 -T^u^i^- + h) =^ u{to), 
then in law on path space 

(3.371) {^«^(^ + i)}*>*„ =^ {^(0}*>to 

and given the state at time t the limiting dual dynamics u{t) is deterministic and nonlinear, 
namely, {u{t),U{t))t>tg is for all G K the solution to the system p.248p . p.249p where in the 
latter quantity we integrate out the age. Moreover we proved that this system has a unique solution 
satisfying 

(3.372) lim e"l*lM(i) = W^, lim U{t)=U{oD), 

> — 00 > — 00 

where U{oo) is the stable size distribution of the CMJ process induced by the collection of the 
occupied sites of the McKean-Vlasov dual process. 

In order to later on relate *W and W* we next focus on *yV and try to get more information 
on its law. This requires, as we shall see later, when we return to the original process from the 
dual to obtain in (|3.372p the higher order terms as t — > —00 for the limiting equation (as N — > 00) 
as well as in the approximation as iV -^> 00 of this behaviour. This means we want to write for 
t —J- —00 the limiting (as TV 00) intensity of N~^u'^ , denoted u: 

(3.373) u{t) = We"' - Const ■ M^^gSat ^ ^(g2at-) 

and determine the constant, but moreover we want to consider for N — > 00, t — > —00 the quantity 

(3.374) u^(i^ +t) = WNit)e"* ■ N + C{t, N) 

a 

and to estimate the order in both N and t of the correction term C{t, N). The latter is equivalent 
to determining up to which order the expansion in p.373p is approximated by the finite A'^-system. 
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First wc turn to the question in p.374p and then to p.373|) for each point of view formulating a 
separate proposition. 

We therefore refine p.372p by providing speed of convergence, in t and uniformity, in N, results. 
Proposition 3.9 (Order of approximation of entrance law in N and t) 



(a) Let Tn = and < S < 1. Then {u^ {t),U^ {t)) and {u{t),U{t)) have the following 



We can couple the {u^ {t),U^ {t))t>o with a fixed CM, J process {Kt,U{t))t>o such that 
(3.375) e""*/ft — > W a.s. as t oo, 



(b) Given W the system \3.248^ , \3.249^ has unique solution {u{t),U{t)) with e~°'*u{t) -)■ W 
as t ^ — oo which can he realized together with {{u^ ,U^), N £ N} on one probability space such 
that the process 

(3.378) (e""*7V"iu^(TAr + i))tgR 

can for every S E (0, 1) be approximated by (e~"*u(t))tgR with "initial condition" W, up to order 
^aSt t £ R uniformly in N . □ 

We now have established a precise relation between {u^ ,U^) and the limiting entrance law 
{u,U). We now return to the solution {u,U) to (|3.248p . (|3.249p and a(i), ^[t) as defined in 
equation (|3.246p . The purpose of the rest of this section is to identify the asymptotic behaviour of 
this nonlinear system as t — > — oo up to higher order terms, i.e. we want to identify in particular 
the correction of order e^"* as i -> — oo which is due to the occurrence of collisions by migration 
of the dual individuals which by subsequent coalescence of the collided individuals can change the 
behaviour of the limiting system. 

Proposition 3.10 (Transition between linear and nonlinear regime limit dynamics {u,U)) 

(a) The pair {u,U) and the functionals a and 7 arising as the limit in ll3.312\) - ^3. S14^ via 
\3.246^ satisfy (here we suppress the k,£): 

(3.379) {a{t) + -f{t))u{t) <{a + j)We°'\ 

(3.380) lim a(i) = a, lim U{t)^U{oo), lim 7(i) = 7, 

(3.381) u{t) = M^e"* - nW^e^"^ + 0(e^"*) as t -00, for some constant k>Q 
and furthermore as t ^ —00 the functions a(-) and U{-) satisfy that 

(3.382) a(<) =a + 5oe"* + 0(e2"*), C/(t) Z^(oo) + t/oe"* + 0(6^"*), 

(3.383) 7(t) =7 + 7oe"* + 0(e2"*), 

where 5o is a positive number Uq is an N-vector and 70 = Uo{l)- 



three properties: 



(3.376) 




and 



(3.377) 
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Using a multicolor system construction, we will see below that all the randomness is given by 
the (non- degenerate positive) random variable W , namely 

(3.384) So is explicitly given by I13.569\) . as Consti ■ W, 

(3.385) Uq is explicitly given by Jg. 575^ and has the form W ■ const,^^ = Const2 • W. 

(b) The total number of particles satisfies 

(3.386) (a(t)+7(t))u(t) = (a + 7)W^e"*-«;*VF2e2"* + 0(e3"*) ast^-oo, 

for some constant k* > 0. 
More precisely with 

(3.387) kW"^ = (5o + io)W + kW'^ = W'^ {Consti + Const2 + k), 
we have that 

(3.388) limsup |(a(i) + -^{t))u{t) - (a + 7)W^e"* - K*W'^e^'^'\ ■ e'^"* < oo. 

f — oo 

(c) Furthermore U(t, ■, •) is uniformly continuous at t = —oo and as t ^ —oo 

(3.389) ||C/(i)-W(oo)-C/oe°*||var-0(e2"*). □ 

Remark 29 Note that the results above show that all the relevant randomness in the dual process 
sits indeed in the random variable W. 

The proof proceeds in four steps. The proof of Propositions 13.91 and 13.101 are based on an 
enriched version of the dual particle system 77 and a reformulation of the nonlinear system p.248p 
and (|3.249p which are given in Steps 1 and 2 below which arc then followed by the Steps 3 and 4 
giving the proof of the two propositions using these tools. In Step 1 we focus on the finite N system 
in a time regime where collisions due to migration become essential, while the Step 2 develops the 
analysis of the limiting system as TV 00 in this time regime. The main focus here is on the 
behaviour as the effect of collisions becomes small and to investigate its precise asymptotics in this 
regime. 

Step 1: Modified and enriched coloured particle system 

Our goal is to realize on one probability space the CMJ-process and the dual particle system 
in such a way that we can identify in a simple way the difference between the two dynamics 
with a higher degree of accuracy than before by using three colours white, black and red. In 
particular we can identify the part of the population involved in one or in more than one collision 
(more than two etc.). This joint probability space can be explicitly constructed easily, based on 
collections of Poisson processes. Since no measure theoretic subtleties occur, we do not write out 
this construction in all its lengthy detail and just spell out the evolution rules. 

In order to identify the contributions at time a~^logN + t oi order e"*, e^"*, e^"*, we want 
to expand the number of occupied sites at time logiV + i, using quantities WN{t) defined as 
g-atj^^N g^j-^j-j then analog higher-order objects, in the form: 

(3.390) {Tn + t)^WN (Oe"* - W'jy (i)e'"* + W^:; (^e'"* , 

with Tn = a^MogiV and N^'^WNit), N-'^W'j^{t), N'^W^it) converging as TV oo,t ^ -00, 
more precisely, 

limt^_oolimw^oo[e-"*Ar-iuAr(rAr + t)] = W 

(3.391) limt^_oo limw^oo[e~2"*iV-2[(M/iVe"* - u^(Tjv + t))]i] = kW^ , 
limt^_oolimAr^oo[e3"*A^~^[uw(7Ar + <)) - WNe"^ - nN^W^e^"*]] < 00, 
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where the limits are needed in probability and in L^. The latter convergence follows from the 
convergence in probability and the finiteness of the moments of W (see Lemma 13. 4p . 

We use the dual particle system to obtain information on the asymptotics as t ^ ^oo of the 
pair (u, U) and also to identify their random initial condition. However to carry this out we must 
enrich the dual particle system to a multicolour particle system on an enriched geographic space. 

Since we want to read off more properties than in the emergence argument based on the mean 
where we had white, black and red particles we shall need now some more colours which allow us 
to record particles which have been involved in one, two, etc. collisions and with associated colours 
their counterpart in a collision-free system. 

Furthermore in order to realize the CMJ and the dual on one probability space we have to 
couple the evolution of certain colours where one belongs to the CMJ-part and the other to the 
dual particle system part. This will also require us to refine the geographic space {1, • • • , N} + N 
(where the black particles were collision-free and moving on N) we had before for the white, red, 
black particles system by adding a further copy N (or more if we need more accuracy). 

More precisely we now introduce a multicolour particle system that is obtained by modifying 
and enriching the coloured particle system defined by p.l77p - p.l82p by introducing (1) new 
colours and (2) new type of sites. The number of new colours and sites necessary depends on how 
precisely we want to control the behaviour as < — > — oo meaning up to which higher order terms as 
t — > — oo we want to go. 

We first need a new colour (green) which allows us to control exactly (i.e. not just estimating 
from above) the difference between the dual particle system rj and our old white-black collision-free 
particle system. Recall that in the old system we created a red-black pair upon a collision such 
that the further evolution of black remains collision-free and the red follows the true mechanism of 
the dual particle system. A real difference in the total number of particles between dual and CMJ 
will however occur only if the red particle coalesces with a white particle but no such loss of an 
individual occurs in the CMJ-process which now will have one particle more due to this transition. 
To mark this we will use the colour green to mark the dissappearing red particle in the dual. 

To allow a fine comparison between the white-red and the white-black system we will use 
coupling techniques which allow us to estimate not only the effect of collisions (and subsequent 
coalescents) but also recollisions, the latter will generate parallel to the red-black construction two 
new colours purple (in the dual system) and blue (in the collision- free comparison system) . For this 
purpose we have to modify the evolution rules when red particles collide or red particles collide 
with white particles. For the bookkeeping of the effects from these events we use the new further 
colours. 

The white particles in the old process also have the same dynamics as the white particles in 
our new formulation but in addition to white, red, and black particles the green, purple and blue 
particles can have now locations in 

(3.392) {1,2,---,A^} + N + N, 

which we will explain as we describe the evolution rules. 

In order to achieve all these goals of a higher order expansion the modified dynamics should 
satisfy the following requirements: 

• for each N the union of black plus white particles is equivalent to the particle system without 
collisions and can be defined on a common probability space by a single CMJ process, 

• the white plus red plus purple particles give a version of (u^ (t), {t, •, •)), 

• the red plus purple particles give a refinement of the red particle system in the white-black- 
red process used earlier, i.e. both together have the same dynamics as the set of red particles 
in ([3X6]) . 
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• the green particles describe the loss due to coalescence after collision, i.e. after a particle 
(white, red or purple) migrated to an occupied site on {1, • • • , N} (by a white, red or purple 
particles) and then coalesced with such particle at this site, 

• the black particles are placed and evolve on the second copy of N and blue particles are 
placed and evolve on the first copy of N. 

• We can match certain pairs of red and black respectively purple and blue particles when they 
are created to better compare the CMJ-part and the one with the dual dynamics. 

• The number of the sites occupied by red, green and blue particles has the same law as the 
number of sites occupied by black particles. 

For each the evolution rules below define a Markov pure jump process which describes a 
growing population of individuals carrying a location and a colour and in addition the information 
which pairs of individuals arc coupled. 

Let /„ be a copy of {1, • • • , n}. Denote by 

(3.393) C = { white, black, red, purple, blue, green}, 

Then the state space is given by the union over n. of the set of maps 



If we just count the number particles with a certain colour and location we get again a Markov 
pure jump process, specifying a multitype particle system on {1,2,..., N} + N + N. The state 
space of this system is given by: 



We shall work with both processes. 

In order to satisfy the properties specified above, the dynamics of the enriched coloured particle 
systems has the following dynamics: 

• Given that k is the number of sites in {1,2,..., TV} which are occupied, and a white particle 
migrates to another site in {1, • • • , TV} the outcome is given as in (|3.179p . p.lSOp and p.lSip 
(a red-black pair is created) but we record that these particles form a red-black pair. White 
particles give birth to white particles and two white particles coalesce to produce one white 
particle. 

• Black particles evolve as before but live on the second copy of N. 

• Red particles give birth to red particles, two red particles at the same site coalesce to one 
red particle. 

• When a red particle coalesces with a white particle (at the same site) the outcome is a white 
particle and a green particle at this site. Moreover we then couple the green particle with 
the black particle that had been associated with the involved red particle. 

• Given that the current number of sites occupied by white, red, purple or green particles on 
{1, • • • , A^} is £, when a red particle migrates it moves to an empty site in {1, . . . , N} with 
probability 1 — and with probability it dies and produces a blue-purple pair where the 
blue particle is placed at the first empty site in in the first copy of N and the purple particle 
is placed at a randomly chosen occupied site on {1, 2, • • • , N}. We also associate the blue 
particle with the black particle that had been associated with the involved red particle. 



(3.394) /, 



n 



Cx ({l,...,iV} + N + N})x (/2). 



(3.395) (No)(«i'2,...,7V}+N+N}xC)^ 
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• Blue particles have the same dynamics as black particles. 

• Purple particles have the same dynamics as the dual particle systems ij on {!,... ,N}, in 
particular they migrate to a randomly chosen site in {1, . . . , N}. 

• Green particles give birth to green particles and two green particles at one site coalesce giving 
one green particle. Green particles on {1, 2, • • • , N} migrate to empty sites on {1, • • • , A''} with 
probability i — jf and with probability to the first empty site in the first copy of N where £ 
is the number of occupied sites on {1, • • • , N}. Green particles on the first copy of N migrate 
to the first empty site. 

• When a green and red particle coalesce they produce a just red particle. 

Remark 30 The reason for this rule is as follows. The red and green pair correspond to two 
red particles one of which has coalesced with a white particle. The green particle corresponds 
to a particle loss due to collision and coalescence. First note that the green particle which 
coalesces with a red one must have been created at that site. Hence both red and green must 
descend from the same collision of a particle arriving at the site and hence the coupled black 
particles share the same site. The red and green particle are coupled with a black or blue 
particle each. The two black particles can coalesce only if they descend from the same black 
particle after arriving at the current site. In this case the number of red and green, respectively 
black and blue is reduced both by one and the original loss of black-white versus white-red is 
cancelled. When the green and red coalesce this corresponds to the coalescence of two red 
particles that would have been at a site having no white particles. Then the coalescence 
reduces the number of red particles to one. This means that the potential loss is canceled out 
and in both cases we end up with a single red particle. 

Green and white particles do not coalesce. 

• We couple for a newly created red-black pair the birth, coalescence with offspring up to the 
time of the first collision of a red particle with a site occupied by red or white particles. We 
then continue by coupling for all future times the associated black particle with the blue 
particle (from the bluc-purplc pair) created at the collision time. 

• When red and purple particles coalesce they produce a red particle. 

• When a purple particle coalesces with a white particle the outcome is a white and a green 
particle at the site. 

Note that these rules lead to a system with the six desired properties we had listed below (|3.392|) . 
The state space of this system is given by: 



We shall single out specific subsystems comprised of particles of certain colours, for example, 
the WRP-system (white, red, purple) or the WRGB-system (white, red, green, blue) which have 
state-spaces: 



(3.396) ({{l,2,...,Ar}-FN + N} X C)^«. 



(3.397) 



.({{1,2 7V}+N+N}x{W,ii;,P,}) 





[({{l,2,...,7V}+N+N}x{VF,ii;,G,B}) 

'o 



respectively. If we need more information, then we have the analogue of (|3.394p . 

Observe that white, red, and purple particles are located only in {!,..., N}, blue particles are 
located only in the first copy of N and green particles can be in either {!,..., N} or first copy of N 
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and all black particles live in the second copy of N. It is important to note that the transitions of 
green particles do not depend on whether they are located in {1, • • • , N} or in the first copy of N. 

We note that the difference between the number of red plus purple particles and the number 
of black particles is due to the coalescence of red or purple particles with white particles. Each 
such loss is compensated by the creation of a green particle and we note that green particles never 
migrate to an occupied site. 

By the construction above we define on a common probability space five coztp/erf particle systems, 

• a CMJ-proccss which is the union of black and white particles which give a version of 
(ift,C/(0)t>o, 

• the white, red and purple particles generate a version of U'^ (t)), 

• a CMJ process which is the union of white, red, green and blue particles and which generates 
a version of the pair {Kt, U{t))t>o, 

• a subset of this above system, the white system generating the pair {K^s^ ^,Ul^,^{t, ■)), 
a CMJ-process which is the union of black and white particles which give a version of 

• a coupled subsystem, on the one hand the system of red and purple and on the other hand 
a subset of the black particles. This coupling is induced by the convention how purple-blue 
and red-black pairs evolve. 

As before in the analysis of (u, U) it suffices for our purposes to study a functional of the 
multicolour particle system by observing at sites only how many particles of the various colours 
occur and counting the number of sites of a specific colour configuration. Our functional of a state 
is given by the counting process 

(3.398) *f (iM/,ifl;,ip,iG, isjisL), 

denoting the number of sites containing iyy white, ir red, ip purple, ic green, is blue and isL 
black particles at time t. 

The process (^'f )(>o evolves itself as a pure jump-strong Markov process with state space the 
counting measures on , the set of colour configurations, due to the fact that the dynamic only 
depends on the vector of the numbers of particles of the various colours at a site. Similarly we 
define pure Markov jump processes 

(3.399) ^^t{iw), 'i>wRp,ti'i'W,iR,ip), '^wRGB,Sw,iR,iG,iB)- 

Next we want to define the coloured versions of the {u, U) system and of certain subsystems. 
We therefore define the number of sites exhibiting certain colours: 

(3.400) /v t = ^ ljiv+iBt>i*f (*iv,i_R,ip, icis, isi), 

(3.401) K^j^p^= ^ li^^,+ij^+ij,>i^!^j^pj.{iw,iB.,ip), 

(3.402) K^RGB,t= X! ^iw+iR+iG+iB>l'^WRGBAiw,iR,iG,iB) 

iiv ArAg .is 

and occasionally we make use of 

(3.403) K^^t^ 1 E «.>i*c.t(*c), CC{W,R,P,G,B,BL}, ic^{iL)Lec- 
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Now wc can define the coloured relatives of (m^, U^) due to our construction all on one prob- 
ability space, setting 

(3.404) {u{t),u^{t),u^j,p{t),u^j,as{t)) = (A^t,A'^,t,if^^p,t,if^«GB,t) = 

TtN (a \ _ l'w.t('iv) 



TjN (■ ■ ■ \ _ 1'wHGB,t('W.ifl,iG,»i3) 



Lemma 3.11 (Properties of enriched coloured particle system) 

(a) Ignoring colour the process {uwFi.PtT^WB.pi^T '))t>0 has the same law as {{u^ ,U^))t>o 
given by LS.235]) - k3.2S9\) . 

(b) Ignoring colour, the the total number of particles or occupied sites in the WRGB-system has 
the same distribution as the number of particles respectively occupied sites in W-BL-system and 
therefore we can identify the total number of occupied sites with the total number of individuals of 
a CM J process which we denote by {Kt)t>o- 

(c) The total number of particles (or sites) in the WRP-system is less than that in the WRB- 
system and larger that that in the WR-system a.s. 

(d) Consider the coloured particle system at times 

(3.406) Tat + t, Tat = log N. 

The total number of purple or blue particles is 0(e'^"*), more precisely, we have uniformly in N , 

(3.407) *^-<iiP,T„ 

iw -ia-ip 

< ^ is ■'i'wRGB,T„+ti'iw,iR,iG,iB) < const ■ NW^e^"'^, 

where 

(3.408) = sup(e-"*A't) < oo, a.s. . 

t 

(e) We consider again a time horizon as in (d). Define WN{t) = N~^e''"^u'^ {T^ +t). The 
process counting the number of red particles produced by collisions of white particles is bounded 
above by an inhomogeneous Poisson process with rate 

(3.409) Ne^"*{W^{t)f 
where 

(3.410) lim WNit) = W for each teR. □ 

A^— >-oo 

Corollary 3.12 The normalized (by N^^ ) number of red respectively purple particles at time Tj^+t 
conditioned on (W, VK*) is uniformly in N 

(3.411) 0(6^"*), respectively 0{e^"*), ast^-oo. □ 
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Remark 31 By introducing further colours, for example, following the first collision of a purple 
particle with a site occupied by some other colour introducing a new pair of colours analoguous to 
purple and blue, etc., we could obtain upper and lower bounds with errors of order 0{e^°'*) for any 

keN. 

Proof of Lemma 13.111 (a) follows by observing that combined number of white, red and purple 
particles at a site behaves exactly like the dual particle process of typical site and when they migrate 
they can have collisions according to the same rules as the dual particle system. This follows since 
when a white particle migrates it collides with the correct probability and then becomes red and 
when a red particle migrates it can collide and become purple and when a purple particle migrates 
it can collide again following the same rule (and remains purple). 

(b) Since the green particles compensate for the loss of red particles due to coalescence after 
collision of a white with another white particle and the blue particles arise upon collision of red 
with red or white and they evolve as the black particles, the total number of particles is as in the 
W-BL system. 

(c) follows since (i) the blue particles are produced in one to one correspondence with the purple 
particles and (ii) the purple particles can suffer loss due to collision and coalescence but this does 
not occur with the blue particles. 

(d) , (e) We have proved earlier in this section using the CMJ-theory that the normalized rate 
for the number of white particles 

(3.412) lu^{TN + t) = Oie"'), 

if we condition on {W^ W^*} since for all N this number is smaller than the number of the white and 
black particles together, which is then asymptotically Vl/Ar(t)iVe"*, with W^it) — > as iV — > oo 
by the CMJ-theory. When white particles migrate they collide with another white particle with 
probability Uy^{Ttq + t)/N. Moreover white migrants are at times before Tm + t produced at rate 
at most WM{t) Nae"*. Therefore the normalized rate for the number of red particles produced 
conditioned on {W^, M^*} is at time + t is satisfying 0{Ne'^"*). 

Finally a purple or blue particle occurs when a red particle collides with a white or red and 
therefore the rate of this event is of order 

(3.413) 0(7Ve2"*) X 0(6"*) = ©(TVe^"*), 

if we condition again on {W, W*}. 

This completes the proof of Lemma 13.111 which will be a key tool in the further analysis of the 
dual process. 

Remark 32 Since the WRGB system can be identified with the W-BL(by assigning the union of 
the red and green particles at a white site to an empty site in the first copy ofN)a system we will 
primarily work with the former and ignore the BL system. We then have both upper and lower 
bounds for the WRP system given by the WRB, WR systems, respectively and this will be our 
primary object for the analysis of the t — > — oo asymptotics. This will provide upper and lower 
bounds with error of 0{e'^°'^) and therefore determine the second order asymptotics as t — > — oo. 

Step 2: Reformulation of the nonlinear (m, [/), (it^pj, [/(p)) equations 
This step has two parts, first rewriting the (u, U) equation and then a second part where we 
do this for the multicolour version of this equation. 
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Part 1 Wc will start by bringing the equations of the limit dynamic (u, U) in a form suitable 
for the purpose of the proof of Propositions 13.91 and 13.101 Recall that {u, U) solves the following 
system of differential equations in the Banaeh space (M x Li{l^,v)^ \\ ■ \\) (remember for the first 
line that J7 is a normalized quantity): 

(3.414) '^^am~<t))u{t)-^{t)u\t), t>to, 



(3.415) = +,s{j - l)lj^iU{t,j - 1) - sjU{t,j) 

+ ^{J + l)jU{t,j + 1) - - m,^,Uit,j) 
+c(j + l)U{t,j + 1) - cjU{t,j)l,^i 

+u{t){a{t) + i{t))[U{t,] - 1)1,^1 - U{t,j)]) 
~uit) (ait) + -fit)) 1,^1 

-(a{t){l-u{t))-j{t)u{t)yU{t,j). 

Remark 33 We want to interpret these evolution equations ^3.414^ , ^3.415^ by a particle system of 
the type of the mean-field dual but now making more explicit the role of collision and in particular 
the first collisions of particles. This will allow us to analyse the behaviour as t ^ — oo of the 
nonlinear evolution equation above. The analysis will be based on the fact that u{t) arises as the 
limit of N^^u^ [Tjs! -\- 1) and similarly U{t) as the limit ofU^{TN + t) where T^ = a^^\ogN. 
This will allow us in the second part of this Step 2 to introduce enrichments of the solution of 
the nonlinear equations through coloured particle systems. Then by taking the N oo limit of the 
normalized coloured particle systems we obtain a coloured limiting evolution and by that information 
about the nonlinear original system. 

We next rewrite the equation p.415p in a form suitable for the analysis of the solution viewed 
as a perturbation of the linear (collision-free and hence u = 0, a{t) = a,^{t) = 7) equation in 
the limit as t — cx). We shall see below in Lemma [3.141 that with u{t) = 0, the system p.415p 
has the stable age distribution (00, M, •) of the CMJ as equilibrium and a — c^^2 kU{oo,R, k). 
Hence we should organize the r.h.s. of p.415|) in such a way that we isolate the linear part on the 
one hand and the nonlinear perturbations of this linear part on the other hand. 

We define for every parameter a € (0, 00) (for which we shall later choose the value a) the triple 
of N X N-matrices 

(3.416) (Qg, gi,L), 
by the equations: 

(3.417) Qo ^ Qo - al + alj^i = {qj,k)j^u&i (= (%,fc(a))j,fceN) , 
where 

qi2 = s, qi,i = -s 

52,3 = 2.S, q2j=d + 2c + a, 

13,3+1 =sj, 'Jjj-i = 1) +cj, q]i=a, j^l,2 

<lj] = --V - - ^jU - 1) - a, 3 1, 
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Qi = fefc)j,/ceN, where 

(3.418) qj, = -1, 

Qjk =0 j ^ k, 

and finally 

(3.419) L = {Ijk), ijj = 0, and for j ^ 1, = 1. 

Note that the matrix is for the forward equation but we consider Qq for the backward 
equation. Then for each a > 0: 

(3.420) generates a semigroup (S'°(t))t>o on Loo(N) 

corresponding to a unique (pure jump) Markov process on N. This process is as follows. The 
matrix Qq defines a Markov process on N. For a = we obtain the birth and death process which 
corresponds to a colony where we have birth rate s for each particle, coalescence of two particles 
at rate d and emigration of a particle at rate c, except when there is only one particle. For positive 
a the process is put at rate a in the state with only one particle. 
Furthermore we abbreviate 

oo 

(3.421) 5(t) = c^A:C/(t,fc) - a. 

k=1 

With these four ingredients equation (|3.415p finally reads: 
Lemma 3.13 (Rewritten U -equation) 

(3.422) ^= Uit)Q^ 

+Sit)[U{t)Qi + lj=i] + uit){a{t) + j{t))[Uit)L - lj=i]. □ 

Remark 34 Note that our equations for (u, U) are forward equations and hence all the operators 
Qq,Qi,L act "from the right" on U(t). 

Remark 35 Note that we can consider the equation \3.^2^ also for arbitrary values a g (0,oo). 
Denote the solutions as 

(3.423) ([/"(O)tGE. 

We shall see below how we can characterize U among those by a self- consistency property. 

We can later use the following information about the semigroup S*" to characterize the growth 
rate a of {u{t))t G K in the entrance law {u, U) as t ^ — oo. 

Lemma 3.14 (Representation of a) 

Consider the evolution equation for U°'{t) in the regime in which (u(t))tgR is identically zero 
and a{t) = a. Then for given a > there is a unique equilibrium state (positive eigenvector) 

(3.424) {g*(a)}jgN for the operator (5q( equalling Qq — a/ + alj=i). 

Then a is uniquely determined as the fixed point defined by the self- consistency equation 

oo 

(3.425) a = c^jg*(a). 
Also, 

(3.426) {q*{a),j = 1, 2, • • •} = {Z^(oo, M, j), J = 1, 2, • • •} = (pi,p2, • ■ •), 
the r.h.s. being the stable size distribution of the CM J process. □ 
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Proof The existence of the unique equilibrium qj{a) follows from standard Markov chain theory, 
the question is whether the self-consistency equation p.425p has a solution. Note for this purpose 
first that qj{0) is the equilibrium for the single site birth and death process appearing in the 
McKean-Vlasov dual process for one component and observe that ^JL2jlji^) > 0- We define: 

oo 

(3.427) ^^:a— >c^jg;(a). 

i=2 

Then we are left to show that the fixed point equation a = F{a) has a solution. We saw above 
-F(O) > 0. Wc claim next that the function is monotone decreasing in a, converging to as a — > oo 
and continuous. 

To verify the monotonicity, let 02 > ai > 0. Recall that the chain for a = starting from 1 
is stochastically increasing to its equilibrium. Now consider the two Markov chains for ai , 02 as 
a sequence of independent excursions away from 1 which end when a jump to 1 at rate a occurs 
but otherwise follow the Qg dynamic. To compare the average height over the excursions, we 
consider two such excursion lengths given by coupled exponentials (a2)~^5, (ai)~^£, respectively. 
Then we observe by a simple coupling argument that the heights of the oi-excursion at times 
(02)"^^ < t < (ai)~^£ stochastically dominate the height at time (02)^^^. Therefore the average 
height over an excursion for the ai-chain is stochastically greater than or equal to that for the 
a2-chain. 

The continuity follows by noting that the a2-excursions from zero converge to the ai excursion 
from if 02 4, fli. 

Noting finally that '^'^2 jlj i*^) converges to when a — > 00, we obtain the existence and 
uniqueness of a solution of (j3.425p which we read as a fixed point of the equation a = F{a). 

Part 2. We now turn to the second part of Step 2, where we give the limiting dynamics of 
the multicolour enrichment as represented by {'U'^wRPGB)^^(AVRPGB)y need systems of 

other subsets of colours in our arguments and therefore we denote the limiting objects analog to 
the noncoloured system by 

(3.428) (u(c),C/(c)), 
with 

(3.429) C = {W, R, P, G, B} or some other colour subset of {W, R, P, G, BL, B}, 
where uq is a positive real number and ?7(c) is a measure on 

(3.430) (No)^. 

This set-up means that we consider the processes of occupied sites, occupied with the various 
colour combinations. The dynamics, in the TV — >■ 00 limit, of the W RP-systcm (or the WRGB- 
system) at time Tn + t is given by an enrichment (u(H'i?,P) ('•)i t^(VF-RP)(0) c>f the nonlinear system 
p.414p . I|3.415p . Namely the latter is recovered as follows: 

(3.431) u{t) ^ U(^iYiip){t), U{t,k)= ^ U(^wRP}it;^w,iR,ip)- 

{{iw,iR,ip)-iw+iR+ip=k} 

In order to write down the evolution equation for the quantities from (|3.427p . we need the 
changes occuring in the underlying multicolour system and its various subsystems which then 
induce the changes in the measure on these configurations. For this purpose we need operators 
associated with the various possible transitions, their rate and their form which are associated with 
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a particular current state. This state is given by a tuple of the form (iw^'i-R^'i-PiiBTiGyiBL) or a 
tuple for a smaller set of colours which gives the number of particles of the various colours at a 
site. Therefore transitions and transition rates are specified by matrices of the form 

(3.432) , withi,j e (No)*^ , C Q {W, R, P, B,G, BL}. 

We can now distinguish two different groups of transitions those which concern only particles of 
one colour and then there are transitions where particles of different colours interact. We specify 
these transitions and the corresponding operators in (|3.433p and then in p.435p . p.437p . 

For the first type the one colour operators let 

(3.433) Q"'* = operator Qq applied to the ith colour occupation numbers - see (|3.417|) 

corresponding to birth, coalescence and emigration 

(as long as there are more than one particle) of the ith colour individuals. 

Next we introduce the appropriate inter-type coalescence operators (for versions with formulas 
for the r.h.s., see p.438p and sequel below) which describe the changes in the limiting frequency 
measure U based on changes in the occupation numbers which correspond to actions of the coloured 
particles. There are essentially two types of intertype-transitions, (1) changes which occur at a site 
with rates depending on the state at this site and (2) the creation of a new site at rates depending 
on the state at the founder site leading to the migration operators. 

The two groups of matrices corresponding to the coalescence, respectively migration operator, 
are the following matrices 

(3.434) = (Qfj), iJeN^ with ACCxCoTAC{a^ b\a, 6 G C} 

running through the list of names of the various transitions are, beginning with the coalescence 
operators: 

= coalescence of white and red at same site 
yielding a white and green pair at this site, 
~ coalescence of white and purple at same site 
yielding a white and blue pair at this site, 
= coalescence of red and purple at same site 
yielding a red at this site, 
= coalescence of green and red at same site 
yielding a red at this site 
~ coalescence of purple and blue at same site 
yielding a purple at this site, 
= Q^^ = 0. 

and the migration operators, which are effectively creation operators for new occupied sites, which 
in particular are then sites occupied by initially only one particle. For sites occupied by only one 
colour we can talk of green (G), blue (B), purple (P), red (R), white (W) sites without ambiguity. 
However for multicolours, a WR site denotes a site having at least one white or one red particle, 
etc. 



(3.435) Q 



WR 



(3.436) 



-tWG 



= creation of white site at an empty site in {1, . . . , A'^}, 
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Q ~ creation of red site at randomly chosen occupied site in {1, . . . , N}, 

qR^R — creation of red at empty site in {1, . . . , N}, 
^R^p^B _ Qj.Qa^;jQ]2 of purple-blue pair 

with purple at randomly chosen occupied site in {1, . . . , N} 
and blue at the first empty site in the first copy of N, 
Q^^^ — creation of purple site at randomly chosen site in {1, . . . , N}, 
Q'~^^^ = creation of green site at first empty site in the first copy of N, 
Q^^^ = creation of blue site at first empty site in the first copy of N. 



The verbal description on the r.h.s above corresponds to the following expressions which we 
spell out in two examples: 

(3.438) (1,0,0,0) = c • iiy, 

(3.439) Q^,:^f^,^a,^B).i^'„,^'^ + la'aa'^) = ' ^w +iR ^1 ^wUw RGsit, i'w , ^R, ^ ^'b) ^ etc.. 



We now develop in the detail the system of equations for UwRGsit) since the former provides 
the necessary upper and lower bounds. 

The pair (w, U) arises as the limit as oo of the particle system (u^, U^) in times 

logiV 

(3.440) Tat + t where Tat := 

a 

The proofs are based on first taking the limit as TV — oo of the (^*(M/i?,GB)' ^(WiiGB)) system 
and then identifying the order of the terms corresponding to the different colours in the limit as 
< — > — oo based on the structure of the particle system. 

Now we set for the rescaled processes (misusing earlier -notation) 



(3.441) 



(3.442) 



IwRGsit) = N ^U^RGsiTN + t), 

^WRGsi^^^w.iR.iG.iB) = U^rgb( 

iwRpit)=N-'u^npiTN + t), 
^WRpi^^ ^M^' *-R' *-P) = ^wrpC^n + t, iw, ip, 



Then let TV — >■ oo and we again obtain, the existence of the limit of the rescaled system {u^ , U^), 
the existence of the coloured versions. The limiting objects are denoted: 

(3.443) {uwRGB{t),UwRGB{t, ■,-,■)), {uwRp{t),UwRp{t, ■,-,■)), 

which will satisfy a system of equations which we give below in p.454p , and (|3.455p respectively in 
p.460p . We omit the details of the convergence proof here, which are straightforward modifications 
of the argument given in the proof of Proposition 13. 



Remark 36 In order to give an intuitive picture, even after taking the limit N oo, we shall 
still refer to this nonlinear system in terms of coloured particles, but of course the quantities 
in question are now continuous quantities that arise in the limit as N oo of the normalized 
quantities corresponding the the well defined particle system given by the above pure jump process. 



Note first the following facts about the limiting evolution which are important for our purposes 
and which follow from the corresponding finite- properties: 
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• since the green particles compensate for the loss of any red particles due to coalescence, the 
number of red plus green particles at a given site has an evolution which is independent of 
the white particles (but not the distribution of the relative proportions of red and green), 

• the red plus green plus blue particle system has the same dynamics as the black particle 
system and since red and black founding particles are created at the same time both systems 
have the same distribution, 

• the particle system comprised of white, red and purple particles has the same distribution as 
the dual particle system. 

Recalling that the WBL-system is a CMJ-system and the RGB process can be identified with 
the set of black particles, it follows that {K^~^{t) + -f^(^GB)(*))t>o ^^^^ than a CMJ process 
with Malthusian parameter a but the difference is of order o(e"*) and we have for i e M, that the 
intensity of the occupied sites in the various coloured systems satisfies: 



(3.444) UwR.(t) < UwB^pit) < UwRB{t) < "WRGB 



< uw{t) + URGBit) = lim 

N^oo 



^iu^iTN+t) + u^aBiTN+t)) 



The intensity of individuals in the various coloured systems satisfy that 

(3.445) ^ {iw + iR)uwRGB{t)UwRGB{i,iWi^Ri^G,iB) 

< X] {'^w + iR + ip)uwRp{t)UwRp{t,iw,iR,ip) 

< ^ {iw +^R + ^B)uwRGBit)UwRBG{t,^W,^R,^G,^B)■ 
iw ,iR-iG,iB 

As a result of these inequalities we note that the WRGB system provides lower and upper 
bounds for the "number" of occupied sites and "total number of particles" for the WRP system 
which corresponds to the dual particle system of interest. We note that the difference between the 
upper and lower bounds for the total number of particles (which corresponds to the number of 
blue particles) is of order 0(e^"*). Since each occupied site must contain at least one particle this 
provides bounds for the number of occupied sites with the same order of accuracy. For this reason 
we focus on the WRGB system and then read off the required estimates for the WRP system up 
to this order of a accuracy. 

We now write down the equations for the WRGB-system. i.e. for the corresponding pair (w, U). 
We need the following abbreviations where all sums are over iw, ip, iG a-nd ib'- 



(3.446) U(M/K)(t) ^ U(^wRGB)it) ■ (^Uvv+ij^>iU{t;iw,iR,iG,iB)^ 

(3.447) U(^wRB)it) =U{wRGB){t) ■ {^'^i».+in+iB>lU{t\iw,iR,iG,iB)^ 

(3.448) aw{t) = c'^iw ■ Uw+iR+iG>2U{wRGB){t;iw ,iR,iGiiB), 

(3.449) aR{t) = j_R • Ua+hv+iG>2U{wRGB){t\T'W ,iRiiG,iB), 



(3.450) aG{t) = c'Y^iG ■ Ug>2 U(wRGB){i]iw,iR,iG,T'B), 



3 PROOF OF RESULTS 



103 



(3.451) asit) = c^iB • Ub>2 U(wRGB){tiiw,iR,iG,iB), 

(3.452) ^wit) = clijy+i^+ic=ili„,=i t/(n/HGB)(i; 1,0,0, 0), similarly for 7flXi),7G(i), 7s 

Then we can write down the equation for (u(c), U(c)) as coloured version of the (u, J7)-equation. 
Again as in the latter case we work in the very same Banach space (M x ii(N, i^), || • ||) without 
mentioning this explicitly below. 

Consider the WRGB-system described by the 

(3.453) {u(^wR.GB){t), U(wRGB){t, ■)),te R) 
satisfying the following nonlinear equation: 

(3.454) — = (aw' -> aw(0 + "ii"H(i))w(WRGB)(i)(l - W(WR)(i)) 



-QG aG(i)lt(WRGB)(0 +aB ^ aB(i)-U(WRGB)(i) 
-{iW iwit) + 7-R ^ 7i?.(^))w(WR)(0'«(WRGB)(0- 



(3.455) ^^^^^^|P^ = Wb)(OQ™W, 



where 



(3.456) Q(w«GB)(^)^ ^ qo.^q(wr)^qRG 

i=WM,G,B 

-a(t)/, 

with initial condition at time t = — oo given by 

(3.457) lim W(wRGB)(i) = 0, 



(3.458) lim Uwrgb {t,iw,iR,iG,'iB) =Uao{iw) iiiR = iG = iB=0 

= 0, otherwise, 

where Uoo is the stable size distribution of the CMJ process. 

If we write for a better understanding of p.455p this equation out pointwise, we obtain (we 
suppress on the r.h.s. the subscript WRGB in U): 



(3.459) 



C^t^(WRGB) ,^ . .... 
Q-^ [t; IW, IR, «G, IB) = 

+ + '^)Uit;iw,iR + 1, - 1,«b) - ^iwiRU{t;iw,iR,iG - l,«s) 

+ ^{iw +«i?)(«G + ^)U{t;iw,iR,iG + 1,«b) 

+ iR)iG^iR^iU{t;iw,iR + l,iG ~ 1,*b) 
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+ aw{t){l - U^WR}{t))l{^y^^^,,^^o.^a=o.^^=o) 
+ aR{t){l - W)l(»vv=o..„ = i,.G=o..i,=o) 

+ {awit) + lw{t))ui^wR){'t){U{t\iw,iR + l,iG - 1,«b) - U{t;w ,iR + ^,'iG~ 
+ {aG{t)+ ^ C/(i;«H',«fl + 1,«G - l,«s))l(o, 0,1,0) 

i+J>l 

+ {aBit) + ^jU{t;iw,iR + 1,«g - 1, *B)?i(wfl) (i))l(o,o,o,i) 

- (a(t)(l - uvvflX*)) - 7(i)MH'i?X*)) • U(^wRGB){t;'iw,'iR + 1,«G - 1,«b)- 

The process (u(H'i?p), [^(W-RP)) satisfies for M(TyfiP) ~ " our original equation, and the quantity 
U{wRP) (t) satisfies a similar set of equations as above with B replaced by P but in this case we 
must add the terms corresponding to the migration of a purple particle to a site occupied by white, 
red or purple. Also when a purple coalesces with red at the same site (which occurs with rate d) 
it produces a red-grcen pair and when it coalesces with a white at the same site it produces a 
white-green pair. The equation for C/(vi/pp) has therefore the form 

(3.460) ^^%7^'^ ^t/(H^HP)(0Q"^"W, 
where 



PR. 



(3.461) Q^^^(i)= Q'^+Q'^'^' + Q'^^ + Q 

i=WM.,P 

+c(l - u{t))[Q'^^'^ + W^«^^] + cu{t)[Q'^^" + Q^^^] 
+cQP^^ - a{t)I. 

This now completes the set of limiting (N — > oo) equations for the coloured particle system at 
time Tjv + 1. 

For the further analysis it is crucial to observe the following two facts: 

• for calculating the asymptotics of first moments of the McKean-Vlasov system it suffices to 
isolate dual process effects which in the limit N ^ oo are of the order e"* as t — > — oo and 
ignore error terms of order e^"* and higher, 

• for calculating the asymptotics of second moments of the McKean-Vlasov system it suffices 
to isolate effects which are in the limit — > oo of the order e^"* as i — oo and ignore error 
terms of order e'^"* and higher. 

It is easily verified that if we condition on the growth constant W and on W, , we can verify 
that as t — oo, we have the following estimates on the "number of sites" occupied by certain 
colour combinations: 

(3.462) ^ UR+iG+iB>iU{wRGB){t;iw,'iR,'iG,iB) = Oie"^), 

iw ,iR,iG,iB 

(3.463) ^ ltB>i'U(wRGB){t]iw-,iR,iG,iB) = 0{e^°'*-) 
and therefore 

(3.464) URGBit) = 0(e2"*) and usit) = ©(e^"*). 
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We conclude showing that as < — > — oo this difference between the red phis white particle system 
(with WRGB dynamics) and the true dual (WRP) is of order o(e^"*) once we have taken the limit 
N ^ oo. This difFercncc is the difFercncc between the system of purple particles and the system of 
blue particles. 

Again we condition on W and W*. We first note that the intensity (after taking TV — > oo) of red 
particles is 0(e^"*) and therefore the rate of production for collisions of red particles with other 
occupied sites by white, red or purple is 0(6"^"*). Hence we have as t — >■ — oo 

(3.465) up{t) = 0(e3"*), u{t) - u^wRGB){t) = ©(e^"*). 

The sites occupied by red, green or blue particles correspond to black sites and therefore bound 
the number of occupied sites eventually lost (compared to CMJ) due to collisions. What is the 
meaning of the three components? The population of green particles represent asymptotically 
as i — ^ — oo the total number of particles lost (in the dual particle system compared with the 
collision- free CMJ-system) due to collisions up to order o(e^"*). 

Remark 37 It is important to remember that we have seen above in ^S^JJ^, ^S^JJ^ that we can 
obtain upper and lower bounds for the exact dual using white plus red for the lower bound and 
white plus red plus blue for the upper bound and that by {3.465\j we know that the difference of 
lower and upper bound is 0(e"^"*) = o(e^"*). Hence suffices for the purpose of first and second 
moment calculations for the random McKean-Vlasov entrance law to work with the WRGB-system 
on the dual side. 

Step 3: 
Proof of Proposition 13.91 

(a) We define the coupling using the multicolour system where white and black gives the CMJ- 
process and WRP the dual particle system. The evolution rules define the processes in a standard 
way for all N on one probability space using Poisson stream for all potential sites, colours and 
transitions. 

Recall that the total number of sites occupied by black plus white particles can be identified 
with a CMJ process Kt and our processes for each N can be coupled based on a single realization 
of this CMJ process using the multicolour system. Let on this common probability space 

(3.466) W := lim e'^^Kf 

Then the existence of the limit in p.466p and therefore assertion p.375p follows immediately from 
the Crump-Mode-Jagers theory. 

In order to obtain the assertion (j3.376p we shall estimate below the mean and variance of 

(3.467) e-°'*N-'^u^{TN + t) (recah Tn = oT^ logiV) 

by bounding the mean and variance of the normalized number of black particles (equivalently, red 
plus green plus blue particles) and finally put this together to get the claim of the proposition. 

Part 1: Bounds on the mean. 

Define here 

(3.468) Wt = e-"'Kt, WN,t = WT^+t, W = Woo. 
RecaU that by CMJ theory ([N]), and finiteness of E[W'^], 

(3.469) w = sup E[Wt] < oo, uP = sup E[W^] < oo. 

t t 
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We condition now on the path {Kt)t>o (and then in particular W is given) and give a conditional 
upper bound for the number of black particles produced by time Tjv + 1. This bound is given by 
considering the following upper bound for the rate of collisions of white particles at time s which 
is 

(3.470) {aN{s)+iN{s))K^^, 

where aN{-) and 7Ar(-) are defined as in (|3.239p . Recall that that aAr(s) — > a{s), 7Ar(s) — > 7(5) 
and (a(s) + ■y{s))Ks < {a + 7)^^86"" as TV — ;> 00. We get a stochastic upper bound of the quantity 
in (|3.470p by taking the random process 

(3.471) ia + ^)W.e"^^. 

Now assume that we realize a Poisson point process on [0, 00) with intensity measure given by the 
density given in (|3.470p . Now we use this Poisson point process to generate the founders of the 
black families. Then we let a black cloud grow descending from one black ancestor born at time s 
up to time (Tjv + t — s) and this evolution is independent of the Poisson point process. This black 
cloud born at time s grows till time + 1 as 

(3.472) W?A,,3(<)e"(*~") 

and is given by an independent copy of the CMJ process starting with one particle at time s. With 
this object we obtain a stochastic upper bound on the number of black particles in the original 
dual process. 

Since at every time s of birth of a black particle we get a cloud independent of all other clouds 
and also independent of {Ks)s>07 '^6 get an upper bound for the expected number of black particles 
given a realization of {Ks)s>o' 

(3.473) / i?[V^/^.,,_,]e"(^"+*-^)(aJv(5)+7w(s))i^f-^ds• 

Furthermore with the Crump-Mode- Jagers theory applied to {Ks,U{s)) we get setting D = 
E\W]^^ £ (0,00) that the quantity in p.473p can be bounded above asymptotically by (recall 
(l?:i551) "for Wn,s)- 

1 nTN-\-t 

(3.474) —J e"(^"+*-')ai:»VKfe2"'ds as iV ^ c5o. 
The mean of the expression in p.474p is bounded by 

(3.475) w-^aD— J e"(^"+*-^)e2"^ds. 
This quantity in turn is equal to 

(3.476) JVW"*(1 - e-"('^"+*)) = iVL>W"*(l - e""*^). 
Therefore we obtain the upper and lower bound: 

(3.477) we^' > E[^u'' {Tn + t)] > E[WN.t]e''' - »"*(1 - 0(^)), 

where the last expression is an upper bound on the expected number of black particles. 
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Therefore we get the final bound for the mean of the normalized number of sites: 

(3.478) > E[e-°'*N'^u^{TN + t) - WN^t] > cn ■ e"*(l - 0(^)) 

where sup^(|cjv|) < oo. 

Part 2: Analysis of the variance 

To complete the result we now have to estimate 

(3.479) Var[e-"*N''\'^{TN + <) - WnA- 

We will verify as a first step that the variance of the normalized number of black particles converges 
to as A'^ — >■ oo and then we will return to the dual particle system. Recall that in our calculations 
we condition on {Kt)t>o- 

The first step is now to condition again on a realisation of a process which is a stochastic upper 
bound on the number of white particles (compare part 1). This bound is given by a CMJ-process. 
Therefore we observe first that given this number the evolution of the black clouds once they are 
founded are all independent. Therefore we obtain an upper bound on the normalized variance of 
the black particles if we use a path of the CMJ-process, which dominates the white population. 

Note that therefore the birth of new black clouds of particles as — )■ oo (which arises upon 
collision) can by (|3.474|) be bounded by a time inhomogeneous Poisson process with intensity 

(3.480) {c.^ jds. 

where we condition on a realisation of W{t),WN,t = W{Tn + t), or alternatively on the path 

{Ks)s>o. 

Define L^ as (for given r in the variable s) the Laplace transform of e~"'^''~'*^/Cr-s (starting 
with one particle). 

The Laplace transform L of the total number of black particles at time r then is given by 

(3.481) L'-(A) = cxp 1^ D^^^^il L:{Xe"^^-^^))ds^ . 

Note that by construction = Lr-s- Furthermore we have that: 

(3.482) supVar[e-°"'Ks] = supL';(0) < oo. 

s>0 s>0 

Now we apply this to r = T^v + t and conclude that conditioned on W (this is indicated by ~ 
on E, Var, etc.): 

(3.483) yar[e~"*iV"i • (# of black particles at time Tn + t)] 
is bounded by (Const means here a function of W only) 

(3.484) ^ Z?iy2^e2"(^"+*-^) ^^^^^^'(O)^^ < Const ^^^^^f^\ 
The r.h.s. is independent of t. Hence uniformly in t G M, as — > oo: 

(3.485) V"ar[e~"*-^(# black particles at time Tn + t)] = oil). 
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Hence conditioned on W the number of black and white particles normalized by iVe"* is determin- 
istic in the limit — > oo. Hence by p.485p the density of white particles alone and the normalized 
difference between the number of black and white particles is deterministic in the iV — >■ oo limit. 
Part 3 Conclusion of argument 

Since the dual lies between the set of CMJ particles and the set of white particles, conditioned 
on the variable W , in the limit N~~^e~"'^u^ {Tm + t) lies between two deterministic curves, both 
converging to the same constant as t — > — oo, namely W . Using p.477p and p.478p we conclude 
inequality p.376p . 

The assertion p.377p is proved as follows. Since the first component was just treated, we focus 
on the second, i.e. . Note that the total variation distance between {Tjy + 1) and U[Tm + 1) 
is bounded by the normalized number of black particles. Then combining (j3.473p with p.476p we 
have 

(3.486) lim WU'^ {Tn + t) -U{Tn + t)\\ ^ Q. 

(b) is an immediate consequence of the analysis above. 

Step 4: 
Proof of Proposition 13.101 

We proceed in six parts. Four parts prove the various claims and two parts arc needed to prove 
some key lemmata at the end of the argument. 
Part 1 - Proof of (|3.379p 

To verify (|3.379p we use that the u{t), a{t), jit) arise as the limit — >■ oo of " j^^*-* , a'^ (t) , {t) 
for which we can use the representation by the multicolour particle system. For each N the 
inequality 

(3.487) (aNit) + -/Nit))u^it) < (7 + a)NWN^te''* 

follows since the left side counts the number of particles in a subset of the set of particles on the right 
side (the difference is induced by the green particles in the multicolour construction) . Furthermore 
we know that WN,t W, a.s.. Hence the inequalities in (|3.487p are therefore inherited in the limit 
as N 00 and give 

(3.488) ia{t) + lit))uit) < {a + -f)We°'K 
Part 2 - Proof of (|3.381|) . (|3.386|) 

To obtain p.38ip . p.386p we use the coloured particle system introduced in Step 1 of this 
subsection. We show first that k and k* are strictly positive and then later on we identify these 
numbers by a more detailed analysis. 

Recall that the total number of white, red, green and blue particles at time T^v + 1 equals the 
total number, WN,tNe"* in the CMJ process and the collection of white, red and purple particles 
is a version of the actual dual particle system 77^ . We have shown that if we consider these particle 
systems at time T/v + t and let A'^ — > 00 we obtain a limit dynamic. The analoguous statement 
holds for the limiting dynamic as — > 00, now with t as the time variable for the multicolour 
system. The techniques are the same as used in Subsubsection 13.2.81 and we do not write out here 
the details again. We are now interested in the expansion as i — > — cx3 in this multicolour limit 
dynamics. 

In contrast to the proof of Proposition 13.91 the argument here now involves the green and blue 
particles. 

The purpose of the green particles is to keep track of the particles lost due to collisions of white 
with red and purple particles (lost by coalescence). The purpose of the blue particles is to obtain 
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with the WRB-systcm an upper bound for the total number of particles in the W^i?P-systcm by 
suppressing the loss of particles that could occur if a red or purple particle migrates to an occupied 
site. We observe that therefore that since the number of blue particles is 0(e'^"*) (recall p.464p ). 
then in order to identfiy the e^"* term in the total number of dual particles as t — )• — oo, it is 
sufficient to control the asymptotics as t — > — oo of the green particles. Therefore we now have to 
study the green population, for which we first need more information about the red particles. 

The rate at which white particles migrate and collide with occupied sites thus creating a red 
particle is given and estimated as follows (recall (|3.488p ): 

(3.489) {aw{t) + -,w{t)Ww{t) < {aw{t) + ^w{t))u^{t) < W^a + 7)6^"*. 

At the particle level, once a red particle is created it begins to develop a growing cloud of red sites 
which grows with exponential rate < a. Using the latter and p.489p we obtain 

(3.490) UH>iUwRp{t,iw,iR,ip)^0{e^''*). 

hv ,iR,ip 

Return now to the production of the green particles. We will show first that the number of 
green particles and sites with green particles is of order e^"* as t ^ —00. 

We can assume that when a red particle is created on an occupied site the number of white 
particles is given by the stable age distribution by p.458p . (We will verify below in (|3.559p that 
in fact the error in p.458p is 0(e"*).) We now want to study the production of sites with green 
particles. We introduce the concept of special sites for this purpose. 

We briefly return to the finite A^-system. We call occupied sites (by a white particle) in the 
coloured particle system 

(3.491) "special sites" at time s 

if s lies between the (random) time when a first red particle arrives at this site until it contains no 
red, purple or green particles. This random time is a.s. finite. 

We transfer this concept to the N 00 limit at time + 1. This means that we incorporate 
the additional mark in the measure-valued description. 

During the lifetime of a special site a special site can produce red, purple and green migrants. 
Special sites are created at rate (recall for the first equality sign that a, 7 are the limits of a{t), j(t) 
as t — > —00, i.e. the ones characterizing the collision-free regime and then use p.464p to get 
o(l)0(e^"*) as bound for the difference of both sides): 

(3.492) (awit) + 7w(i))"w(0 = + iWit) + 0(6^"*) = W^ia + 7)6^"* + 0(6^"*) 

and therefore by integration from —00 to t the number of special sites created up to time t is 
bounded below by 

(3.493) W^^^e^"' + o{e^^'). 

2a 

Then green particles at a special site are produced at rate 

(3.494) d ^ iw{iR + ip)UwRp{t;iwjRjp) > d ^ ii„(iii+ip)>iUwRp{t;iw ,^R,^p)■ 

iw,iR,iB iw,iR,ip 

Since iw{iR + ip) > 1 automatically at a special site we use p.492p and (|3.493|) . to get that the 
expected number of green particles at time t is bounded below by 

(3.495) d- VF2^-t^e2"* + o(e2"*). 
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Provided the limit exists (see below), this implies that (recall the green particles describe the loss 
of the WRP-systcm compared to the W-BL-systcm) 

(3.496) lim e-^°'*[{a + -f)We'^^ - {a{t) + -/it))u{t)] = k* > 0. 

f — oo 

In order to now get also information on k, we need information on sites rather than on particle 
numbers. In order to get that the number of sites with only green particles is of order e^"* as 
t — ^ — oo, we argue as follows. Since there is a positive probability that a green particles migrates 
before coalescing with a red or white, this implies (if the limit exists, see below) that 

(3.497) lim e-^"'[{We"' - u{t)] ^ k > 0. 

Hence we now know (provided that the limits taken exist) that: 

(3.498) K,K*>0. 

Next, in order to identify the constant k, k* we need to obtain the 2nd order asymptotics. It 
remains therefore to determine the actual value of k, k* using this information. 

We first consider the production of blue particles. Since there are 0(e^"*) special sites and a 
blue particle is created only when a migrant comes from a special site and hits the set of size u{t) 
of occupied sites, it follows that the number of blue particles produced is of order 0(e"^"*) and 
hence indeed the blue particles and therefore also the purple particles play no role determining the 
e^"*-term. In particular the production of green particles by the loss of purple particles is of order 
0(e'^"*) and can be omitted. 

We first obtain an expression for k*. Since the green particles represent the particles lost 
in the M^i?P-systcni due to coalescence of red or purple with white particles, this is obtained by 
considering the growth of the green particles in more detail. We note that the loss of purple particles 
is of smaller order than the loss of the red particles as mentioned above. As a result, in order to 
to identify the constant k or k* in the expressions for u{t) (number of sites) or {a{t) + "f{t))u{t) 
(number of particles) we can work with the WRGB-system instead of the WRP-system. Hence 
in carrying out the analysis using the WRGB-system we obtain a lower bound for the number of 
particles lost but as shown above the resulting error is 0(e'^"*). 

Once a new green particle is produced by a white-red coalescence at a special site we are 
interested in the number of green particles that migrate before the end of the life time of the special 
site. As noted above we can assume that when the red arrives the number of white particles is given 
by the stable size distribution. The number of green particles at the special site and the process 
of producing green migrants can then be obtained by the analysis of a modified birth and death 
process where we now have two types, namely, red and green with birth and death rules inherited 
from the WRGB dynamics. In terms of the limit process this involves the forward Kolmogorov 
equations for this modified birth and death process which serves as a source of migrants for the 
green particle system which then evolves by the CMJ dynamics. 

To summarize, in order to obtain the distribution of green mass up to an error term of order 
0(e'^"*) we consider 

• the WRGB-system instead of the WRP-system, ignore blue particles 

• the production of special sites by red-white collision, 

• the production of green particles and green migrants at a special site, 

• a CMJ process with immigration, with immigration source given by the green migrants from 
special sites. 
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Recall from the derivation of a lower bound on k above that in the W RGB-system red always 
migrates to a new (unoccupied) site or otherwise a purple-blue pair is created so that the number 
of white plus red and blue particles gives an upper bound to the original interacting dual particle 
system in the limit iV — > oo. The number of the green particles produced only by white-red 
coalescence (omitting purple- white coalescence) gives up to an error of order 0{e^°'*) a lower 
bound to W{a + 7)e"* - (a(t) + j{t))u{t). 

We make the following definitions in order to turn bounds into precise asymptotics. We say 
below "expected", to distinguish from the usual expected value, when we calculate quantities of 
the form ^ icUlt, iyy, ifi.: *Gi "i-B, ip), where the sum is overall Ia, A € C. Let 

(3.499) 5i(t,s) 

be the "expected" number of green particles at time t at a special site created at time s. Note that 
a special site has a finite lifetime (with finite expected value) since due to coalescence it will revert 
to a single white particle at some finite random time after its creation. Therefore the function 
gi{t, s) = 5i(i — s) is bounded. 
Let 

(3.500) g2{s,r)dr 

be the rate of production of green migrants at time r at a special site created at time, i.e. the c 
times the "expected" number of green particles at special sites. This function is also bounded as 
above. 
Let 

(3.501) 53 (r, <)< const •e"^*-''^ 

be the "expected" number of green sites produced at time t from a founder at time r and finally 

(3.502) 53*(r,t) < const-e"('-'') 

is the "expected" number of green particles produced at time t from a founder at time r. These 
four functions determine the numbers k and k* uniquely as we shall see next. 

Now the creation of the first green particles at a site and then subsequently green sites occurs 
from a red-white coalescence at a site. At such an event a green particle arises and if there is no 
green particle yet a new green site is created at this moment. From these founders now a cloud 
of green particles and sites develops. The evolution of these clouds is independent of the further 
development of the number of white particles and white sites. Therefore conditioned on W we get 
that the "expected" number of green particles at time t, denoted is asymptotically as i — > — oo 

(3.503) 1^2^*6^"* -f 0(6^"*), 
where k* ~ Van and 

t— f — CXD 

ds + 0(e"*). 

Since gi and g2 are bounded, the integral is finite. Observe that gi(t, s) ~gi(t — s) and g2{s, r) = 
52 (j* — s),53(r, t) =]j^(t — r) by construction of the dynamic of the green particles, which do not 
coalesce with white or red particles. Therefore the first term in p.504p is independent of t. The 
second part is given by an integral depending again only on t — s and hence the complete term 
again does not depend on t. This implies the convergence of Kj as t — > — oo. 



(3.504) 



-2at 



{a + 7)e 



2as 



5i(*,s)+ / 92{s,r)gl{r,t)dr 
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Wc now turn to the identification of k that is wc turn from particle numbers to number of sites. 
As in the identification of k* we can show that there exists n such that 



(3.505) uwR{t) = We° 



0{e 



We obtain the constant k by counting green sites (i.e. sites which are not also occupied by only 
red or white) instead of green particles. Conditioning on W we get the "expected" number of green 
sites, at time t is asymptotically as i — > — oo: 



(3.506) W^^^e^"* + 0(e 
where k = lim Kt with 

i— > — oo 



3at\ 



(3.507) 



-2at 



{a + 7)e 



2as 



92is,r)g3{r, t)dr 



ds + 0{e 



3Qt\ 



The existence of the limit follows as above. 

Part 3 - Proof of (|3.380p . (|3.382p . (|3.389p 

Next we return to our original nonlinear equation, which we now relate with the quantities 
of our multicolour particle system. We will need to collect in (|3.509p - p.514[) some facts on this 
system used in the proof. 

We first note the relation between the original dual and the WRP system and the possibility 
to replace them with the WRGB-system for the asymptotic as t ^ — oo, namely as t ^ — oo: 



(3.508) u{t) = tiwRp(t) = u^^{t) + 0(e-^"*), 



(3.509) U{t,k) = ^ Uw+iR+ip=kUwKp{i,iw,iR,ip) 

_ _wrgbO^ lj„,+iH=fct/wRGB(i; ifl, ic, is) + o(l). 

u(t) 

Here the first factor arises since the normalization (by the number of sites) for the WRP system 
is u(t) and it is uwrgb(0 fo'" the WRGB system. We also recall that by construction 

(3.510) MWR < W WWRP < ^^WRB- 

Recall furthermore that the sites occupied by black particles are in one-to-one correspondence 
with the sites occupied by the red, green and blue particles and the number of white plus black 
sites is Wje"* with Wt ^ W s&t ^ oo. Therefore the difference in the number of occupied sites 
in the (W-BL)-system and the WRGB-system arises from white particles sharing a site with the 
coloured particles, but those sites are represented as two sites in the (W-BL)-system. Hence 

(3 511) '"wrgb(^) _ S»„,iH,»G,ifl [lzw+»fl-H»G-H»B>i]^WRGB(^, ^w ^R, ic, is) 

W^e"* Y.i„,zi,,iG,iB\^^«^>'^^^iR+io+tB>l]UwKGB{t,iw,iR,iG,iB)' 

Furthermore the estimates on the number of green, red and purple particles in the part 2 of 
our argument for Proposition 13.101 imply as well 

(3.512) "wrgb(^) =i + ^|yeOt + o(e2a«) 

WWRP(^) 
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and 

(3.513) a{t) = !™^(a^^(t) + aR{t)) + 0(6^"*), 

(3.514) j{t) ^ ^wrgbW {U^+.,„^,UwKGB{t,^w,^B.^G,^B)) + 0{e''^'). 

Using the equations p. 508113. 514p we shall show below that (conditioned on W) we have the 
following approximation relations of i — > — oo for u, a, 7 and J7, which finish the proof of (j3.380p . 
(13:3821) and (l3386l) : 



(3.515) 


\u{t) 


-T^e"*! =0(e2"*), 


(3.516) 


\a{t) 


-a|=0(e"*), 


(3.517) 




-7l = 0(e"*), 


(3.518) 


El 


^ ^ liH+i«'=fc 




k=l 





:Uw-RGBit,iw,iR,iG,iB) -U{(X),k))\ = 0(e"*). 

It therefore remains now to verify p.515p - (j3.518l) in the sequel. 

The bound (|3.515p follows from the fact that the difference between the W-BL-system giving 
M^e"* and the WRP-system giving u{t) is bounded by the RGB-system which satifies u^Qsit) = 
0(e2"*). 

Turn next to the proof of (j3.516p . First note that the bound of the WRP-system (for which 
a{t) is the rate of colonization of new sites by multiple occupied sites) from below and above by 
the WR-system respectively the WRB-particles from the WRGB-system yields by dividing the 
inequality by uwrp{~ u): 

(3.519) + < a{t) < "™f^V v^(0 + an{t) + anit)) 
wwRp(r) wwRp(r) 

The difference between the first and third expressions in (|3.519p is 0(e^"*) (bounding asit) by 
0(e^"*) and using p.512p ). Therefore we get using the representation of a{t) by the WRP-system 
(compare (|3.513|) '): 

(3.520) a{t) = ^ [lnv>i ("i-w) + Unr=i,iR>o{-iw + ir) + I 



TT ij- ■ ■ ■ ■ \ "WRGB(i) , ■2at\ 
■UwRGB{t,lW,lR,lG,iB) ■ —:- + 0(e ). 

We can obtain the expression for a, which is suitable for the comparison with a{t), as follows. 
This constant a arises from the (W-BL)-system. Furthermore the RGB-particles are in correspon- 
dence with the black particles, only sit always on the copy of N instead of N or {1, • • • , N}. (Note 
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that wc arc interested here on particle numbers!) Therefore since W — BL is a CMJ-system in the 
stable age-type distribution 

a= uwRGB{t){We''')-^ 

( S [lni.->l*W + liw-=ol4K+iG+iB>l(*-R + *G + *b) 

(3.521) ^w,^a,^G.^B ...... 

+ li„^i^ij^+i^+igyo\l'W + IR + «G + *b) 

+ li»'>i(*-R + «G + «s)]C^wflGs(i, iw,iR,iG, 

Therefore wc can represent a(t) ~ a + a{t) and by combining p.520p and (|3.52ip together with 
the fact that blue particles are ©(e'^"*), we get an expression for a{t) which together with p.512p 
results as t ^ —oo in 

(3.522) a(t) =a + 0(e"*), 

which proves p.516p . (See Lemma [3. 151 b elow for the characterization of 5(t).) 

Similarly we can proceed with the remaining claims p.517p and p.518p . This completes the 
proof of ([X55n)) . (p:^ and (jX^ as pointed out below ^MM- 

Part 4 - Proof of (|3.384p and (|3.385p 

For this purpose we must investigate the difference between a(t), U{t), 7(i) and a, U{oo), 7, 
respectively more accurately than in the bounds above. Namely we need an expression with error 
terms of order 0(e^"*) resp. 0(e^"*) in p.515p resp. p.516p - p.518p . For that purpose we use 
again the multicolour representation. We begin by establishing the order of these differences rather 
than only upper bounds. This is based on the coloured particle system. From the above discussion 
wc expect that a{t) = a{t) — a age"* for some > which we have to identify. 

Recall that the intensity of green particles at time t corresponds to 

(3.523) (a + 7)e"* - (a(0 +7(0)w(i) 

and the number of green particles is given W of order 0(e^"*) so that conditioned on W we have 

(3.524) (a + 7)M^e"* - {a(t) + -/{t))u{t) > 5e^'^\ -00 < t < to, 

for some positive constant 6. 

We want to sharpen this inequality above to a precise second order expansion. To do this and 
to thereby identifying Sq , 70 we now return to the analytical study of the nonlinear system (|3.414p , 
p.415p and prove the following. 

Lemma 3.15 (Expansion of a{t) and^[t)) 
(a) Let 

(3.525) a{t) := a{t) - a, j{t) ^ j{t) - 7. 
Then as t —> —00 we have 

(3.526) 5(t) =5oe"* + 0(e2"*), 

(3.527) 5o > is given explicitly by \3.569\) and is of the form Const ■ W . 
Moreover, 

(3.528) U{t) = U{t) - U{oo) = Uoe"' + ©(e^"*), 
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where Uq is given explicitly in Jg. 575\ l and is of the form Const ■ W. Then 70 — Uo{l). 

(h) Define u* as the solution to the nonlinear equation \3.414^ with initial condition at ~ 00 

(3.529) lim e-"*u*(t) = 1. 

Then as t ^ —oo we have the second order expansion: 

(3.530) u*{t) = e"* - (fo - ^)e2«t + 0(e3"*), 

a 

where ctg is obtained from by setting W ~ 1 in the formula i3.569\} . □ 

With these results wc obtain immediately (|3.386p . This would complete the Proof of Proposition 

mm 

It remains to prove Lemma 13.151 To prove this we need some preparation we do in the next 
part. 

Part 5 - Statement of proof of Lemma 13.161 

We and formulate a statement that gives an explicit representation of u* (and hence of the limit 
of suitably shifted in terms of a{-) which is exact up to the third order error terms: 

Lemma 3.16 (Identification of u in terms ofu) 
Consider the solution to the ODE 

(3.531) ^^^^^a{t)[u*{t)-b{t){u*{t))\ -oo<t<oo, 

with boundary condition at —00 given by lim e~"*ti*(t) = 1 and with the abbreviations 

t— >-— 00 

l{t) 



(3.532) bit) = 1 + 



a{ty 



(Recall that the general case is obtained by a time shift by if 1 is replaced by W and a{-),b{-) 

are shifted accordingly.) 

Define with & = 1 + (7/a) the function u on R; 

g^at^Jl^{a{s)-a)ds 

(3.533) u{t) = w ^ — ~°° < * < 00. 

(a) Then 

(3.534) |u*(i)-u(t)| =0(e3"*). 

The function u satisfies (here again 6 = 1 + ^ J 

(3.535) u{t) ^ ^ ^^^t as t -00. 

(b) We have for Tn — cx."^ log that in distribution: 

(3.536) l.-(T. + t)=^u*(t + i^^)=2(t+i^) + 0(e-*). □ 

N N^oo a a 
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Part 6 - Proof of Lemma 13.161 

(a) Wc start with the foUowing observation. Since by the coloured particle calculation we 
obtained earlier in this proof, see (|3.516|) . (|3.517p . 

(3.537) |7(t) - 7| + \a{t) - a\ < const • e"* 

and a > 0, then 



(3.538) \b{t) -b\ = \b{t) - (1 + -)| < const • e 



a 



When b{-) is not constant wc cannot obtain a closed form solution of (|3.531|) . However we can 
obtain an approximation that describes the asymptotics as t — > — oo accurate up to terms of order 
0(e2"*) as follows. 

Let 

(3.539) {uto{t))tm 

denote the solution of the modified equation (|3.53ip where we replace our function b{t) by the 
constant b and put the initial condition ut^, at time to- 

The solution of the modified equation is given by the formula 

U \ ft a{s)ds 

(3.540) uto(t) = , t>to. 

l + 6u*„(M(e^'«"^^^'^-l) 

Now let 

(3.541) ut, = e"*°, 
so that 

(3.542) ut,{t) = , t > to. 



We then get with p.535p that 



(3.543) u{t) = lim «*„(<) = w ^ — -oo < t < oo, 

noting that the integrals are well defined by p.537p . Hence u satisfies a differential equation 
( (|3.531[) with b{t) = b) which wc now use to estimate u — u* . 
Let v(t) := {u{t) - u*{t)). Then v{t) satisfies 

(3.544) = 5{t)v{t) - b{u{t) + u*{t))v{t) + {b{t) - b){u*{t)f. 

Using p.537p . p.538p and the fact that u*{t) < const • e"* we then obtain that 

(3.545) < const • e"*w(t) + ©(e^"*). 
at 

Therefore \v{t)\ < const • e'^"* and hence 

(3.546) \u*{t) - u(t)\ < const • e^"* as t -oo. 



3 PROOF OF RESULTS 



117 



Wc conclude with an estimate for the integral term in u 

(3.547) {a{t) + -f{t))u{t) 

Furthermore by (|3.524p . 

(3.548) {a{t) + -f{t))u*{t) < {a + 7)e"*. 

Then solving in p. 5471) for the e-l^ -term we obtain with p.534p for t — oo the relation: 

(3.549) lit) = e-''--^"^")""^''-' < 5— = 1 + 6e"* + o(e"*). 

(b) The second assertion follows from part (a). The first assertion follows from p.312p . 
This completes the proof of Lemma 13.161 

Part 6 - Proof of Lemma \3AE\ 

We separately show the different claimed relations in points (l)-(3) for 5,7 and then u* . 
(1) Relation S3.526\) and {3.527^ 

Here our strategy is to express di{-) in terms of the rewritten (u, U) using the Markov process 
generated by Qq and the operators Qi and L. Using (|3.422[) . and p.572p we can rewrite the 
nonlinear system defining (u, U) in the following form. Let U{t),QQ,Qi,L be defined as in (|3.415p , 
(Emi), (jXiTHl) and define ly by 

(3.550) j.(j)=jl,>i. 

Furthermore recall that by setting a = a we get the following: 

(3.551) {S")t>o = semigroup with generator Q" ^ Qq + alj^i. 

This semigroup has a unique entrance law from t -> — oo, since it is standard to verify that S" 
defines an ergodic Markov process. 

Next let {U°'{t))t£R solve the (forward) equation for generator Q", i.e. 

(3.552) — C/"(i) [/"(t)Q", teR, 
at 

and define the difference process {U{t))t^^ (we now suppress the superscript a in U): 
U{t)-U°'{t). 

= t/(i)Qo + m[U{t)Qi + l,=i] + u{t){a{t) + i{t))[U{t)L - l,=i]. 
We can now represent due to the definition of «(•), [/, [/", a{-) the function a{t) as 
(3.555) a{t)~a^<cU{t),v> . 

Therefore (recall p.550p ) by the formula of partial integration for semigroups (for v see p.550p 
with the semigroup 5" of [/" as the reference semigroup and then U as the wanted object: 



(3.553) U{t) = 
Then 

dU{t) 



(3.554) 



dt 



3 PROOF OF RESULTS 



118 



(3.556) a{t) = a{t) - a = {cU{t),v) 

= f c[5(s)((t/(s)Qi + l,=i)5r_„i^)+^.(5)(a + 7)((t/(s)L-lj=i)^r-„j^)]ds. 

J —OG 

Next set 

(3.557) U{^)^{pi,p2,...) 

and note that by explicit calculation (see p.418p . ^.419^ ): 

(3.558) U[oo)Qi + lj=i = (1 -pi, -p2, -P3, • • •)> U{oo)L - = (-l,pi,p2, • ■ •)• 
Note that for t —oo: 

(3.559) lim C/(t) = Zi(oo), ||W(oo) - C/(t)||i 0(e"*), 

where U{oo) is the stable size distribution of the McKean-Vlasow dual process where the norm 
II • 111 is as in p.263p . Hence we get from p.556p that 

5(t) = f c[a{s){{U{^)Qi + lj=i)Sf_,,v) 

J —OO 

+u{s){a + i)m{^)L - l,^,)Sf_„v)]ds 
+0(e2"*). 

Next note that (recall p.550p and p.558|) . (|3.557p and the fact that the Markov process for S*" 
increases from the initial value 1 stochastically to its equilibrium) we have the relations: 

(3.560) - a < ((W(oo)Qi + lj=i)5f , i^) < and {{U{oo)L - lj=i)S'f , v) > 0. 
Recall furthermore that 

(3.561) {U{oo)S^, v)^- for all t > 0. 

c 

Let now (use p.560p for possitivity): 

/>oo 

(3.562) Ai^c {{U{oo)L - lj^i)e-°"' Sf ,i^)ds > 0, 

Jo 

/>oo 

(3.563) A2 = -c e"""((Z^(oo)Qi + lj^i)Sf,iy)ds > 0. 

^0 

Now multiply through both sides of (|3.556p by ae~"* and set 

(3.564) 5{t) = e-"*5(t), = e-"'S^. 
We obtain the equation: 

(3.565) a{t) = c[ [a{s){{U{oo)Qi+lj^i)St,,v) 

J — OO 

+u{s){a + 7)((Z^(oo)L - lj=i)S?_^^,iy)]ds. 
This is a renewal-type equation of the form (choosing suitable positive functions /, g, recall p.560p ): 

(3.566) a ^ a * (— /) + u*g, 
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where the integral over / over R is equal to A2, which is less than 1 due to relation (|3.56ip and 
p.564[) . furthermore u{t) W as t ^00 and g is an integrable positive function with integral 

Wc now claim that as t — > — od 

(3.567) S(t) — ^ C and \5(t) - C| = 0(e"*), 

where C is calculated as usual in renewal theory. 

Then the solution (uniqueness is verified below) to p.566p is given by 

(3.568) a{t) = 5oe"* + 0(6^"*), 
where So > is given by 

(3.569) ao ^ W. 

I + A2 

The fact that the errorterm in p.568p is of the form 0(e^"*) follows from (|3.560p . Moreover 
()3.568p is the unique solution of order 0(e"*) to (|3.556|) . To verify the uniqueness we argue as 
follows. 

The difference of two solutions hi, h2 must solve according to p.566p that 

(3.570) (/ii - /i2) = (/ii - /i2) * (-/). 
We can then verify that either hi = h2 or 

(3.571) \ihi - h2)it)\ < sup \hi{t) ~ h2{t)\ 

(-00,0) 

and therefore indeed /ii — /12 = 0. 

This completes the proof of p.526p and (|3.527p . 

Remark 38 We have with \3.246^ and LS. 288\) the following ODE for a{t) as function of (u, U): 
(3-572) ^= |[cf:Mt,j)]=cf: ,(|f/(U-)) 

3=2 j=2 

00 

= [«(i)(l - "W) - litHt)] 

+u{t){a{t)+^{t))[lj^iU{t,j - 1) - U{t,j)] 
+c{j + W(tJ + ^)-cjU{t,j)lj^i 
+sU - '^)h^iUit,j - I) - sjUitJ) 

+ ^(J + niU{t,j + 1) - - l))l,^iUit,j) 

-[ait)il~b{t)u{t))) -Uit^j)]. 

Collecting terms we obtain: 

(3.573) ^ =^s-ait) + ^){ait)+^it))-'-^Y.fUit,j) 

+u{t){a{t) + j{t)f + cu{t){a{t) + 7(t)). 
Hence we can determine a as solution of an ODE once we are given (u, U). 
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(2) Relations \3.528\) and \3.385]) . 
Recall (|3.554|) which yields 

(3.574) = U{t)Q'^ + a{t)[U[oo)Qi + l^-^i] + u[t)[a + ^)[U{oo)L - + ©(e^"*). 
Then using the expression p.568p for a{t) and u{t) = M^e"* + 0(e^"*) we obtain 

(3.575) C/o= / e"-^S'^,Fds + 0(e2"*) 

where V = M^(5!o [Z-/(oo)Qi + lj=i] + (a + j)[U{oo)L — lj=i]). Recalling that is a semigroup on 
7'(N), it follows that the integral is well-defined. 

(3) Relation i3.530\) . 

With the knowledge we have now the expression p.530p then follows by p.534p by substitution 
with ([3368| in ([3335| . 



3.2.10 Weighted occupation time for the dual process 

In this subsubscction we focus on the behaviour of the quantity SAr(a~^ log + t, 2) in the limit 
iV — > oo as a function of t. We know that this limit is £t(2) which is a random probability measure 
on [0,1], in order to identify its distribution it suffices to compute the moments via duality and 
p.320p . Recall also that the weighted occupation time of the dual determines the probability that 
no mutation occurred changing the value at the tagged site from type 1 to 2. Why, what do we 
have to prove about the dual process? This we now explain first. 

Consider Tn = a^^ \ogN and toiN) t oo, to{N) = o(logiV). Recall that 

(3.576) nf'^-''= i^w+^R + ^p)'^si^w,^R,^p), 

where n^^*^'' is given by the dual particle system started with k particles at each of ^ sites. 

Then we know that we only have to replace W by W'^^'^^ and otherwise we get the same equation 
as we had for fc = ^ = 1. Hence we have 

r{t+TN) 1 i-t 

(3.577) / [TV-^nf^^^Vs - - / {a^^^'\s) + -i^^^^\s))u'^^'^\s)ds, as iV ^ oo 

Jto{N) C 

and we need the r.h.s. in the limit as i — > —oo. Integrating the expression given by p.386p we get 
the following expansion: 

(3.578) /* (a(fc,€)(s)+7(fc,£)(s))ii(fc,^)(s)ds = W^('^'^)-^^i^e"*- — (VF('^^^')2e2"* + 0(e3"*). 
J-oo a 2a 

Recall that in terms of the normalized solution of the equation for the dual (w*,C/*), (i.e. 
e"*u*(t) — >■ 1 as t —oo) we have by the above on the one hand 



(3.579) 



^{iw I n^''''du] \ =^ Cla^-^ ^Ms)+jis))u*{s)ds 
[ to(JV) J l- 

-ii'fe.Ki) eP([o,oo)) 



and on the other hand in terms of the original process we have (cf. p.ll8p . p.597p ). 



(3.580) E 



x''Ct{2,dx) 



e"'"Vfc,f(t,dy) for anfc,^e 
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Note that the random time shift by plays an essential role. The interplay between this 

shift and the nonlinearity of u{t) determines the distribution of Ct{2). To determine this effect we 
study the properties of the r.h.s. in the two equations above. In particular we want to show that 
£f(2) is neither Sq nor Si and we want to show in fact that it is truly random and has its mass on 
(0, 1). This can be translated into properties of the r.h.s. of p.580|) which we will study using the 
dual. 

We next collect some key facts needed to calculate the probability of mutation jumps as ^ oo 
which appear on the r.h.s. of (|3.580p . 

Lemma 3.17 (Properties limiting dual occupation density) 
(a) If c > the limit object Vk/ satisfies: 

(3.581) i.M(i,(0,oo)) = 1, 
and 

(3.582) lim Vn.i{t, [K,oo)) = 1, for all K. 

n— >oo 

(h) Denote by < strict stochastic order of probability measures. Then for c > 0, d > 0; 

(3.583) i^k+i,! > i^k.i 

(3.584) iyi,2{t) <iyi,i{t)*'^i,i{t), 

where * denotes convolution, 
(c) Let no G N and set 

(3.585) i„„ (e, N) inf {i : nf -""-^ = [eN\ } - - log N. 

a 

Then every weak limit point (in law) as N ^ oo, denoted i„o(£) satisfies: 

(3.586) tnai^) ~^ ~oo in probability as no — > oo. 
Furthermore 

(3.587) tng{£) — > — oo in probability as e — > 0. □ 

Proof of Lemma 13.171 

Proof of (a). The claim z/fc,f(t, {0}) = follows from p.381|) since W''''^ > a.s. and 
Vk,t{t, ({oo})) = since for T/v = logN and every i e M we have: 



(3.588) lim sup £; 

Af-s-oo 



1 rTN+t 

— / n^^'^^'^du 
nJo ^ 



< oo. 



Therefore i'k,e is a probability measure concentrated on (0,oo). 

The relation p.582p will follow from the argument given for the proof of (j3.586p in (c). 

Proof of (b). The first part of (b) is immediate by a coupling argument realising the (fc + 1) 
and k particle system on one probability space. The second part follows from the fact two typical 
individuals picked among the descendants of the two populations interact through coalescence with 
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positive probability before they jump since once they both occupy 0{N) sites, so that we have a 
fraction of both populations overlapping. This proves (b). 

Proof of (c). The main idea in the proof of (c) is that, asymptotically as uq — > oo, starting 
no particles at a tagged site the number of particles at this site decreases to 0(1) in time 1 but 
during this time still produces O(logno) migrants as we shall see below. Therefore if the number 
of initial particles, uq increases to infinity, then an increasing number of populations start growing 
like Wic"'* and they do so independently. Hence writing the total population as 



(3.589) 



m(no) 

E 



e 



a{t+a^^ log Wi) 



we see that if m{nQ) diverges as tiq ^ oo, then the total number of particles reaches 0{N) at 
a time (^logA^ + tng) where i„Q decreases as uq increases to +oo. Hence it remains to give the 
formal argument for the fact that m(no) is of order lognp as no — >■ oo, which runs as follows. 

Consider Kingman's coalescent {Ct)t>o starting with countably many particles . Consider the 
number of particles in a spatial Kingman coalescent which jump before coalescing at the starting 
site. This is the given via the rate of divergence of the entrance law of Kingman's coalescence from 
0. Hence we need the rate of divergence 



1 

(3.590) J \Ct\dt - I log<5| as 5 ^ 0, 



which follows since the rate of divergence is wellknown to be 6~^. We connect this to an initial 
state with no particles which is obtained by restricting the entrance law of Kingman's coalescent. 
We then see that the number of migration jumps diverges as no tends to infinity. 

To get p.587p we use that tno{e,N) can be bounded from above by the time where H°°'""'^ 
reached first [eA^J. By the analysis of {KtXt) we know that for this it suffices to show that 
T^'"°{e) — > — oo as e — !• 0, which follows from the exponential growth of Kt immediately. 

This completes the proof of the lemma. 



3.2.11 Proof of Proposition 11.121 Part 1: Convergence to limiting dynamics 

In the proof of Proposition 11.71 it sufficed to work with the expected value E[x^ {^logN + t)] 
(since < xi < 1). However in order to prove Proposition 1 1 . 1 21 it is necessary to work as well with 
higher moments in order to identify not only the time of emergence but to determine the dynamic 
of fixation. The strategy is to carry out the following two steps next: 

1. show that weak convergence of {'E}^^'°' {t))t>to to the limiting deterministic McKean-Vlasov 
dynamics for t > to follows from the weak convergence of the marginal distributions at tg^ 

2. prove that the one dimensional marginals {'B}^^'°' (to)) converge weakly to a random (not 
deterministic) probability measure on [0, 1]. 

Step 1 Reduction to convergence of one- dimensional marginals. 
In order to prove that (S'^^'"(t))igR converges weakly to a random solution of the McKean-Vlasov 
equation we argue first that it suffices to prove that the one-dimensional marginals converge. 
Namely if one has that, then one can use the Skorohod representation to get a.s. convergence 
on some joint probability space. Then however we can use that (using duality to obtain a Feller 
property) 

(3.591) s'°s^"(io) ^ /^o, 
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implies {'E.^^'°' (t)}tQ<t<to+T converges to a solution of the McKean Vlasov equation (|1.18p with 
initial value fig- 

This follows by noting that we claim here a standard McKean- Vlasov limit of an exchangeable 
system satisfying the martingale problem (|1.3p . (jl.4p and with initial empirical measures converg- 
ing. The proof is a modification of the standard proof of the McKean- Vlasov limit (for details see 
proof of Theorem 9, [DG99|). 

Step 2 Convergence of marginals: reformulation in terms of the dual. 
We now prove the required convergence of the one-dimensional marginal distributions. To do 
this we recall first that moments determine probability measures on [0, 1] and that therefore the 
collection of "moments of moments" of a random measure denoted X on [0, 1] given by 

(3.592) E[Y[ / x''^X{dx)], fc„meN, 

i—l 

determine the law. Compactness is automatic so therefore it suffices to verify that for all k,( G N 
the (k, £) moments of the mass of type 1 

(3.593) ml,{t)=E{[[ x^E'°/^"{t,l){dx)Y), Vfc,^eN, 

J[o,i] 

converge as — > oo to conclude weak convergence of the one-dimensional marginal distributions. 

In order to analyse the empirical measure and its functionals we return to the original system 
on the site space {1, 2, • • • , N} and express the quantity through moments of observables of the 
system. These moments are obtained in terms of the dual process by considering the dual with 
initial function generated by taking the fc-product given by ^{i} ® . . . ® l{i} at i not necessarily 
distinct sites. 

This representation can be simplified a bit. Namely note that for bounded exchangeable random 
variables (Xi, • • • , Xjv) one has: 

1 ^ 

(3.594) ;^^^E[{-Y^X^Y]^E[ n ^^]- 

Therefore to compute limjv_i.oo £^[/[q 2^'^2^^'"(i, l)(c?a;)]^ we use the dual process {'qn^t)t>o 
where the particle system 77 starts in configuration 770 with k-particles at each of i distinct sites 
i = 1, ■ ■ ■ , £ and the function- valued part starts with 

e k 

(3.595) ^o+-(8)(8)(l{i})- 

j=i i=i 

Let 

(3.596) n^-'^'^ 

denote the resulting number of dual particles at time u. 

Recall that the mutation jump to type 2 (from type 1) is given by ^{1} — > 0. Recall furthermore 
that the selection operator l^^ = 1|2} preserves the product form (note for every state / before 
the first mutation 1 — 2,/l(2) = 0) and just generates another factor ^{i] with a new variable. 
Therefore the dual (before the first rare mutation 1^2 occurs) is a product of factors Ijij. 
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and this product integrated with respect to /if" with /io({l}) = 1 is or 1 depending on whether 
or not a mutation jump has occurred. Therefore 

(3.597) g{t) P[no mutation jump 1 — > 2 occurred by time T/v + t] 

TN+t 



Hence in order to prove that the foUowing hmit exists: 

(3.598) rukjit) = hm m^^{t), 

N-^oo '' 

it suffices to prove that the foUowing holds: 

t^fc,£(t) e 7'([0,oo]) , i/fc,<?(0((0,oo)) = l, for every /c,^eN 
and then we automatically have as well the formula 

/•OO 

(3.600) muAt) = \ e-"'yiykAt){dy)- 

Jo 

The existence of the limit p.599p is a result on the dual process which we proved in Proposition 
13. 8[ part (|3.320p . This will be used in the next section to prove that we have convergence to 
a random McKean-Vlasov limiting dynamic which is truly random by showing that the limiting 
variance of the empirical mean is strictly positive. 

3.2.12 Proof of Proposition [IT12I Part 2: The random initial growth constant 

In the previous sections we have studied the asymptotics of the dual process in terms of the Crump- 
Mode- Jagers branching process and the pair (m, U) following a certain nonlinear equation. In this 
section we use these results to establish that the limiting empirical distribution of types is given 
by the McKean-Vlasov dynamics with random initial condition at time —00. Recall that in order 
to describe emergence we assume that initially only type 1 is present and we wish to determine 
the distribution of X2{t) := limyv-i-oo X2 (Tn + t) and its behaviour as t — > —00 subsequently. 

We have so far established that Sj^^'"(t, 2) converges to £t(2) which is a solution to the McKean- 
Vlasov dynamics and that 

(3.601) [ xCti2){dx) ^ a,s t ^ -00. 
Jo 

Therefore by Proposition [TT3] (see proof in Section [3221) ; Proposition 1 1 .71 and p.85p the following 
limits exist in distribution and satisfy 

(3.602) lim e"l*l / xCt{dx) ^ lim lim e"l*lx^(i^^ + 1) W, 
t— >— 00 t— >— 00 w— ^00 a 

so that Ct is a random shift of C* (cf. (|1.3ip ). namely, C* iog*w and 



(3.603) C[*W] = lim /:[e"l*l / xCtidx)]. 

We next consider the first and second moments of *W. 



(3.599) C 



1 

N 



TN+t 



^ n 



du 
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Proposition 3.18 (Moments of *W) 

The first and second moments o/*yV satisfy: 

TTlh ^ 

(3.604) E[*W]^m*E[W], with m* = — , b^(l + -), 



a 

2Qt^ 



(3.605) Var / xCt{2){dx) =0(6^"*), 
and 

(3.606) Var[*W]^m*'^[E[W\f. □ 

Proof of Proposition [37l8l Recall the equation p.578p from Subsubsection [372 .101 and note that 
this approximates (u(t))tgR for t — >■ — oo up to terms 0(e'^"*): 

(3.607) / -{a{s) + -i{s))u{s)ds = -( ^ ^ V e"* - ^^W^e^'^'] + 0(6^"*), k* > 0. 
J-oo c c \ a 2a J 

This follows from Proposition 13.101 - see Lemma [3. 151 for the proof that k* > 0. 
Therefore 



(3.608) exp^-^y {a{s) + -f{s))u{s)ds^ 



OO 

: exp ( -m* ( We''' - ^W^'e^"* ) + ©(e^"*) ) , 



where m* = 

An immediate consequence of this relation is that (recall (|3.598p for a definition): 
(3.609) mi,i(t) = 1 - m*E[W]e"' + 0(6^"*), for t -oo 
and therefore 



(3.610) lim e"l*li;[/ xCt{2){dx)] ^ m* E[W]. 



Now consider the second moment of the type two mass at time t which we denote m^'^\t). Here 
we consider the dual process with one particle at each of two distinct sites. The corresponding 
growing clouds have independent random growth constants Wi and W2. We use the formula 

(3.611) m^^\t) := E[ J xCti2){dx)f = E[j xCt{l){dx)f 

-2E[ J xCt{l){dx)] + 1 = mi,2(t) - 2mi^i{t) + 1. 

In order to calculate TOi,2(i), wi^i(i) we expand (|3.608p to terms of order o(e^"*) for W = 
Wi + W2 respectively Wi where Wi,W2 are independent copies of W, to obtain 



(3.612) 



exp ( -m* ( bWe"' - —W^e^"' 



0(e-^"') 



1 - m* bWe 



2a 
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Therefore 

(3.613) mi2(t) =l-m*bE[Wi+W2]e°'^ + m*^E[Wi + W2]^e^°'* 

+ ^im*bfE[Wi + W^2]^e2"* + 0(e3"*). 
Combining this with p.609p and p.Olip . we see that: 

(3.614) m''^\t) ^ m*—E\Wi]E\W2]e'^"* 

2a 

+ ^im*bfE[Wi]E[W2]e'^"' + 0(6^"*). 

Note that the coefficients of the higher order terms involve higher moments of W and cancel 
provided that the latter are finite. Hence to justify the cancellation it is necessary to verify that 
the higher moments of W arc finite. However this was established in Lcmma l3.4l in Subsubsection 
13X5] 

Inserting (jX^ and ([53Ti)) in Var[x2it)] = m^{t) - {E[x2{t)f we get: 

(3.615) Ya.r[x2{t)] - m* '^b{E\W]f e^'^' + 0(6^"*). 

2a 

If we can show that 

(3.616) E[{x2{t)f]^0{e^"'), 

we get via normalizing by e"* and letting t — > — oo indeed: 

(3.617) Var[*W] = m*'^b{E\W]f. 

2a 

Therefore we get ()3.606p once we have the bound on the supremum over time of the third moment 
of X2{t), which follows from Lemma [3.321 in Subsection 13.51 

Here we can use the fact that the collision-free regime gives a stochastic upper bound of the 
finite N system and then that in fact according to p.617p we can bound the third moment of the 
scaled variable in t. 

Remark 39 The randomness expressed in *W arises as a result of the fact that the mutation 
jump occurs in this time scale at an exponential waiting time (int) together with the nonlinearity 
in {3.24^ , {3.249^ . The random variable W governing the initial growth of the dual population 
(which describes the limiting behaviour of the CM J process) arise from the initial birth events in 
the dual process and has its main influence due to the nonlinearity of the evolution in t. However 
W will enter only through its mean in the law 0/ *W due to a law of large number effects. 

The moments o/*yV are determined in terms of the random variable W arising from the growth 
of Kf in the dual process. However as we have seen in the second moment calculation above the 
determination of the coefficients of e*"'"* depend on an analysis of an asymptotic expansion of the 
nonlinear system and in particular u{t) in orders up to k. Although we do not attempt to carry 
this out here, we show that with respect to Laplace transform order the law of *}V lies between the 
case in which the collisions are suppressed (deterministic case) and a modified system in which the 
the correction involving J {a{s) — a)ds is suppressed, that is, replacing a{s) by a. This illustrates 
the role of the collisions (nonlinearity) in producing the randomness in *yV and allows us to obtain 
bounds for the expected time to reach a small level e (cf. [1]). This we pursue further in Subsection 
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3.2.13 Completion of the Proof of Proposition [TTT^ 

We now collect all the pieces needed to prove all the assertions of the proposition, we proceed 
stepwise. 

Step 1 Completion of the Proof of Provosition ll.lW a) 

This was proved in Subsubsection 13.2.111 which heavily used 13.2.101 

Step 2 Completion of the proof of the Proposition \1.12\f b ) 

In order to verify that the limiting marginal random probability measure Ct{j) has for j = 1,2 
really a nontrivial distribution and is not just deterministic, it suffices to show that 

(3.618) E{[ J xCt{l)(dx)]^) > {E{[ J xCt{l){dx)])f. 

The quantity J xCt{l){dx) arises as the limit in distribution of if (i), the empirical mean mass of 
type 1. Note that the empirical mean mass process {x^{t);£ = 1,2)00 is a random process which 
is not Markov! To sec this look at the second moment of mean mass, it involves the covariance 
between two sites! More generally the dual representation of the kth moment of the empirical 
mean is given by starting k particles at k distinct sites. 

Note that no coalescence occurs until time 0(log A^) and then the two clouds descending from 
the two different initial dual particles interact nontrivially. Then it is easy to conclude that the 
limiting variance of X2 {Tn + i) as TV -> 00 is not zero using p.584[) . 

Step 3 Completion of the proof of Proposition (c). 

The relations ((L88)) . (fTMl follow from the dual representation (|3.580p of j^x^Ct{2)dx from 
the fact that i'k,i{t^-) =4> t — > —00 and i'k^i{t,-) =^ 5oo as t — > 00. This follows from 

combining p.318p and p.320|) and then using (|3.316p to get the first claim and using u^-'^{t) — > 00 
as t — > 00 to get the second claim. For the proof of (|1.91|) we will use the general fact that 

(3.619) P{Ct{l){{l}) = 1) = lim mk,i{t). 

The result then follows from Lemma [3. 171 equation (|3.582p . 
Step 4 Completion of the proof of Proposition \1.12\ (e) 

The convergence of the empirical measure processes to a random solution of the McKean-Vlasov 
equation follows from Proposition 13.81 together with Proposition 1 1.1 21 (b). 

Step 5 Proof of Provosition ll.lW f ) 

The claims (fTM]) - (fTMl was proved in Subsubsection 13.2.121 in p.602p and (|3.603p . The 
assertion (jl.95p follows from Proposition [3T8l 
Step 6 Proof of Proposition (g) 

For the assertion p. 961) we refer to a calculation we do later, see (|3.888p . 
All these steps complete the proof of Proposition II. 121 

Remark 40 The same approach can he carried out on fJ^ for fixed L or 11^ . Namely the dual can 
be viewed as follows. Start with one particle, then the particle system (factors) can be viewed as 
a stochastic Fisher-KPP equation (at least until the time of the first mutation). In particular in 
we expect linear growth according to a travelling wave solution (see \CDf ) and not exponential 
growth. In this case emergence occurs in time 0{-\/N). 
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3.3 Droplet formation: Proofs of Proposition [1.9111.111 

In this section we assume that c > 0, m > 0, s > 0, and examine the proeess in which rare mutants 
first appear in a finite time liorizon in the population at tlie microscopic level, that is only at some 
rare sites they appear at a substantial level, then later on after large times these rare sites develop 
into growing mutant droplets, that is, a growing collection of sites occupied by the mutant type 
but still being of a total size o{N). Then finally in even much larger times this leads to emergence 
at the macroscopic level once we have 0{N) sites in the droplet. 

We show that in the microscopic growth regime of the droplet the type-2 mass is described by 
a population growth process with Malthusian parameter a and random factor W*. In the next 
Section [231 we use this structure to investigate the relation between *W and W*. Recall that *W 
arose in the context of emergence and fixation by considering time log + t and letting first 

— > oo and then i — > — oo whereas W* arises from the droplet growth at times t and by letting 
first iV — >■ oo and then t — > oo. 

To carry out the analysis for the droplet growth we consider the following four time regimes for 
our population model: 

(3.620) [0, To), where < To < oo, 

(3.621) [To, Tat), where Tn ^ oo, as N ^ oo and Tjv = o(logiV), 

(3.622) [Tn, ^ +t], where < (5 < 1, t G M, 

a 

S log N log N 

(3.623) [ ^ , + t], where < S < I, teR. 

a a 

The first two time regimes are needed in proving the Propositions 1 1 . 9ll 1 . 1 1 [ the two remaining ones 
arc necessary to prepare the stage for Subsection 13.41 where we shall relate *W and W*. 

Outline of Subsection 13.31 

We now give a description of the evolution through the stages corresponding to the time intervals 
given in ^M^-^M^- 

The first step is to examine the sparse set of sites at which the mutant population is of order 
0(1) in a fixed finite time interval and then to describe this in terms of a Poisson approximation 
in the limit TV — > oo. This happens in Subsubsection [5 . 3 . 1 1 and proves Proposition II . lOl 

In Subsubsection 13.3.21 we analyse the consequences for the longtime properties and prove 
Proposition 11.111 and in Subsubsection [373.31 the Proposition 11.91 

Subsequently in 13.3.41 we recall some related explicit calculations and in 13.3.51 and 13.3.61 we 
continue with the time intervals p.621[) - p.623|) to exhibit the law and properties of W*. 

3.3.1 Mutant droplet formation at finite time horizon 

Recall the definition of (],^'"^)t>o in (|1.13p . Furthermore we recall the abbreviation: 

N 

(3.624) x^{t) = J2x^U,t) 

for the total mass of type 2 in the whole population of sites. We have to prove the convergence 
of (jf^'™)t>o as TV — > oo to (II™)t>o and then we have to derive the properties of this limit as 
t — > oo, which we had stated as three propositions which we now prove successively, but not in 
order, we close with Proposition 11.91 
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Proof of Proposition 11.101 

The strategy to prove the convergence in distribution of (2^'^)t>o as — > oo is to proceed in 
steps as fohows. We first consider only the contributions of mutation at a typical site where we 
have in reality two type of rare events, (1) the immigration of mass of type two from other sites and 
(2) the building up of this mass via rare mutation at this site. Indeed if we isolate the two effects 
and first suppress the immigration of type-2 mass, this simplification leads to iV-independent sites. 
We therefore first look at one site, then at an independent collection of TV sites and then finally 
build in the effect of immigration (destroying the independence) from other sites. In other words 
we work with the simplification in Step 1 and Step 2 and return in Step 3 to the original model. 
In Step 1 and Step 2 each the main point is condensed in a Lemma. 

Step 1 (Palm distribution of a single site dynamic) 

Recall Lemma [TT^ and the definition (|1.36p of the single site excursion measure Q for the process 
without mutation. We must now consider the analogue for the process with mutation. Consider 
the single site dynamics (with only emigration but no immigration), which is given (misusing 
notation) : 

-cx^{l,t)dt + sx^{l,t)x^{l,t)dt+'^{l~x^{l,t))dt 

+ ^d-{l~x^{l,t))ix^il,t)dw2ii,t), 
a 

N' 

We use size-biasing to focus on the set of sites at which mutant mass appears in a finite time 
interval and use the excursion law of the process without mutation (with law P) to express the 
excursions of the process with mutation (with law P). We prove first as key tool the following. 
Let 

(3.626) Qe X(C(M+,[0,1])) 

be defined as the excursion law of the process in (|1.33|) (which is (|3.625p with to = which then 
does not depend on N). Furthermore let 

(3.627) P^ = C[{xl{t))t>o] C[{xl{l,t))t>a] and a;^ solves (IXB^ with x^(0) = ^ = 0' ^ = ^■ 
Furthermore let 

(3.628) W^ex G C([0, oo), R+), w(t) = for i < Cb and t > Cd, ?«(i) > 

on the interval Cb < t < Cd for some Cb < Cd G (0, co)}- 

Note that then (i, and Q are unique functions of the element w G Wox- We shall use these 
functionals 

(3.629) a,Cd: Wex^(0,oo) 
below. 

Next observe that every continuous path starting at and having an unbounded set of zeros 
we can write uniquely as a sum of elements of Wox with disjoint intervals of positivity and with 
every point x not a zero of the path we can associate a unique excursion between 



(3.625) dx^{l,t) = 
x^(l,0) - 



(3.630) Cband^. 
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Lemma 3.19 (Single site excursion law and Palm distribution) 

Consider the evolution of i3. 625]) and let a ~ 0, d > 0, c > 0, to > 0- Then the following 
properties hold: 

(a) Fix to > and K <E N. For k = 1, ■ ■ ■ , K set Ik := X2{-^to). Let ^^(03) denote the event 
that the continuing trajectory satisfies, recall US. 628\) and the sequel, CS ~ 'E ^3 with x = -^io- 
Furthermore consider the same quantities for processes X2^k o.'^^d denote then by I^., respectively A^. 

Replace in \3. 625]) the mutation term by efk and f^ is the indicator of [-^^^o, "^'^o] ■ These 
processes are called x\ k ■ 

Then the Ik are identically distributed random variables with means j^to and variance asymp- 
totically of the form as K ^ 00. 

Then the probability of more than one such excursion is 

K K 



(3.631) lim l(^fe) > 1| E l(^fc) > 1) = 0- 



k=l k=l 

This remains true for the Ik and Ak . 

(b) The following family of measures Q((ai, 02), 03, •) and a measure Q on Wox exists. Choose 
ai, 02 e [0, 00), oi < a2, 03 < and A G B{C{[0, 00), R+)). Then 

(3.632) Q((ai,a2),a3, A)) =: Q{Cb{w) G (01,03), Cd - Cb > 03, w G A) 

^ P'i^x G (ai,6i)ff(w) G (ai,a3), Q ~ Cb > 0-3, w € A) 

e^O £ 

The measures Q on Wex can be represented as (iQ as in 113.626]) ). 



(3.633) (Q((ai,a3), as, ^) = / Q{C > a3,w{- - s)) e A)ds. 

(c) Now consider i3. 625]) which has mutation rate £ — and put a ~ 0. Then for to > 

(3.634) \im N ■ E[x^{l,to)]> 0. 



N- 



(d) Let fi^ be as defined in {L14^ , that is, the Palm distribution o/a;^(l,i) in i3. 625]) with 
a = 0. The following limit exists: 



X /p Q(C6 e ds, Cd > t, w{t) G dx)ds 



(3.635) iiridx)^ lim Jl'^ (dx) ^ 

J /o xQiCb e ds, Cd > t, w{t) G dx)ds 


Hence the Palm distribution fi^ is the law of a random variable of order 0(1) which is also 
asymptotically non- degenerate as TV —>■ 00. 

Furthermore as a consequence of \3. 634^ and \3. 635]) the first moment of the Palm distribution 
of X2 (l,io) has mean satisfying: 



(3.636) lim E[x^{l,to)] = lim ^^\f}l;'f'j € (0,oo). □ 

W-i-oo TV-foo E[[X2 (l,to))\ 



Proof of Lemma 13.191 

We prove separately the parts a) , b) and then c) , d) of the Lemma. 
(a) and (b). 

To prove (a) and (b) we begin by approximating the single site process {x2{t) '■ Q < t < to} 
with law (which satisfies p.625p with ^ = e) by a sequence of processes, where we replace 
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the mutation term (induced by the rare mutation) in different time intervals by mutation terms 
at a grid of discrete time points getting finer and finer. In order to keep track of the mutations at 
different times we split the type 1 into different types. Namely let K E N and consider a K + 1 
type Wright-Fisher diffusion with law 

(3.637) P"'^, 

which is starting with type having fitness and the other types having fitness 1 > (selection at 
rate s) and with mutation from type to type k at rate e only during the interval (-^^-^^j 

We first consider a simpler process, which is built up from if-independent processes. Namely 
consider independent processes j, representing the masses of types k = 1, . . . , K in which in each 
time interval we suppress mutation of all types except some k. Then j.(-) is a solution of p.625p 
with mutation term el^k^^.^ jLt^^i )- Let /fc(£) := x| ^,(1, -^to)- Let ^^(03) denote the event that 
the continuing trajectory satisfies Cd ~ > 0,3. Note that the /fe(e) arc identically distributed 

random variables with means j^to and variance asymptotically of the form + o{-^). 
Note that as £ — > 0. 

(3.638) ip/,(,)(.4fc(a3)) - ^Q(C > a^) + o{Ik{e)). 



Therefore 

(3.639) e {a„a2))jPi,ie){AM) 



k=l 

= ^ E ^ «2))^^^Q(C > «3) + oie) as e ^ 0. 

K ^ — ' K e 

k=l 

Moreover by independence, Prob{Ak O Ak') = P/^(e)(Afe)P/^,(e)(Afc') = 0{e^), k ^ k' , and 
therefore the probability of more than one such excursion in the limit t — > is equal to zero. 



(3-640) lim ^ -Pi^i^M>^) ■ Pi.ieM,,) = 0. 

k,k' 



Then letting K ^ 00 we get using the continuity of i -> Q(w(t) £ •) that the following limit 
exists 



(3.641) 



b G (ai,a2),Cd - C& > a3,w{tQ) > x) 

^ E ^ («i,«2)) I n {x{to) > x}) 



lim lim ] 

e^O K—yoo 



lim 



Ei(— 



KIkie) 



kto. 



> as, w{t - —) > x) + o(e) 



> aa, w(io — s) > x)ds. 



We claim that the analogous result is valid for the multitype model, that is, letting A^, denote 
the presence of an excursion of type k of length greater that 03 and having at time Iq mass bigger 
than x, starting in (01,02) 



K 2 
(3.642) limP-'^(Q A*)= / 



> 03, w{tQ — s) > x)ds. 
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To verify that this remains true with dependence between the types as they arise in a multitype 
Fisher- Wright diffusion, we return to the law P'^'^ . The resuh then foUows from (|3.64ip by noting 
that 

(3.643) P' ^'iAk n A,,) < Pj,^,^iAk) ■ Pi,,ie){Ak'). 

The latter holds, since 
(c) and (d) 

The proof of the convergence of /i^ as N oo proceeds by showing that for every fc e N we 
have: 

(3.644) \un N ■ E{[x^ {i,t)f ) = m\im I x'' [ Q{Cb e ds,Q - Cb > S,w{- - s) e dx) 

and that the limiting variance is positive. 

We first use the dual process to the process defined in equation p.625p to compute the first and 
second moments. Since we discuss the limit TV — > oo over a finite time horizon with finitely many 
initial particles, we get in the limit no immigration term, but only emigration at the migration 
rate. 

We warn the reader at this point that we use the same notation for the dual process of this 
single site diffusion as for the one of our interacting system. Let Ili"'-' denote the number of factors 
in ()3.7|) at time t starting with j factors at time where Hp'' is the birth and death process with 
birth and death rates 

(3.645) sk and ck + —k{k — 1) respectively. 

If we consider finite N we have additional immigration at rate N~^{Il[^^''^ — fc)+ if at the site 
considered we have k particles. This means that as A*" — >■ oo and over a finite time horizon with 
probability tending to 1 at rate 0{N~^) no immigration jumps occur. 
Then for fixed t we have that: 



(3.646) E[x^ {i, t)] = l- E[x'{ [i, t% 



Elx^ht)] - i?[exp(-^/>l^)'^dH)] 

t , , 



(3.647) 



£:[exp(-f jWu'du)], as TV cx) 



1 - f n^u^du], as iV ^ oo 
1-^, a&N^oo, 



where ci = ci(t) e (0,oo) is constant in N . Hence 

(3.648) N ■ E[x^ {i,t)] ^ mci ^ m [ E[n^^'^]ds. 

For fixed t, as iV — >■ c>o we calculate the second moment: 
E[{x^{t,m = i?[exp(~^/>l^)^^d^)] 

(3.649) ^ E[cM~W I^n^du)] 





-I mc2{t) 

N ' 
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for some C2 — C2{t) G (0,oo). Hence 

E[x^{t,t)^] =l-2E[x^{i,t)]+E[xi{i,t)^] 
(3.650) ~ 1 - 2E[exp{-^ /J ni^'>du)] + i?[exp(-f J^* U^^Uu)] 



Therefore 



(t) = m[2 f E[Ui^^]du- f E[Ui''>]du]. 
Jo Jq 



(3.651) lim N ■ E\x^(i, tY] = 277iCi (t) - mc2 
Therefore the l.h.s. of p.636p equals 

(3.652) (?£iW^. 

This Umit is larger than since (for every t) 

(3.653) C2(t)<2ci(t) 

holds because of the positive (uniformly in N) probability of coalescence between the two initial 
particles of the two populations before a migration step of either one. This implies that every limit 
point of the sequence of laws of a;^ {t) (solving (|3.625p ) has positive variance. 
Similarly we introduce 



z 

(3.654) Ck{t) = jli^Uu 



and we can establish writing 

(3.655) E[{x^{^,t))'^] = ^ (%^iy E[{x^ {z,tm 

j=o ^-^^ 

and then using 

t 

m 



(3.656) E[{x^{i,t)y] = E[exp{-— / Ui^Uv 

the convergence of all fc-th moments as for fc = 1, 2. This then gives the weak convergence of the 
Palm distribution 'jl^ as — >■ oo. 

Finally we have to verify the relation (|3.635p . that is, the limiting Palm distribution is repre- 
sented in terms of the excursion measure. 

Note that for every fc e N we know that: 

(3.657) / x''Qt{dx) 
Jo 

x''Q{x2{t - s) e dx)ds ^ [ [ x''{\im-P^X2{t - s) e dx)ds 



JQ JO JO 

t 



e^O £ 



[ lim -E^[{x2{t - s))'']ds. 
Jo ^^0^ 
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Using the duality wc calculate then (with E^ denoting the expectation with respect to Pg 
(3.658) lim -E,[x2iu)] = lim -E[l - (1 - e)"- *] = S[n,W)]. 



(3.659) lim -E^ \x2 (u f] = hm -(l-2E, \xi (u)] + E. \xi (u?] 
£->-o e e->o e \ 

= lim i (l - 2E[1 - (1 - £)""'] + E[l - (1 - e)"""]) 

= 2£;[nW]-i?[n(2)]. 

Substituting this (with fc = 2) in p.657p and comparing with p.648p and p.649p we verify p.644p 
for fc = 1, 2 and (|3.636p . Similar calculations can be used to verify the claim for all fc G N. ■ 

Step 2 (Compound Poisson limit) 

Consider a collection of processes where each component of the system is still as in (|3.625p in 
which we ignore the effects of immigration into sites and only emigration is still accounted for, 
namely, 

(3.660) dx2 {i,t) = -CX2 {i,t)dt + s x^ (i,t)x2 {i,t)dt + ^{l ~ x^ {i,t))dt 



+ \ld-{l- x^{i,t))x^{i,t)dw2{i,t), 



Note that this is a system of N independent diffusion processes. Furthermore we calculate for the 
total mass process 

(3.661) dx^it) < ((-c + s)x^{t) + ^)dt + dMt, 

with a martingale {Mt)t>o, where 

(3.662) {M)t<d-x^{t). 

Therefore we can bound by a submartingale and if we start with only type 1, i.e. if a ~ 0, then 
by submartingale inequalities we get: 

„r , AT , M 1 constant 

(3.663) P[sup\x^{i,t)\> e]< 



t<T 



N 



We return now to the study of the independent collection of diffusions we introduced in the 
beginning of this step. Consider the atomic measure- valued process 

(3.664) 0^nt>o, 

which is defined as the analogue of {2^'™)t>o but with only excursions arising from mutation, that 
is ignoring immigration at each site (emigration is still accounted for) which was defined in (|3.654p . 
Similarly let 

(3.665) {x2ii,t),ieN} 

be the corresponding collection of independent diffusions. Then we carry out the limit N — > oo. 
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Lemma 3.20 (Droplet growth in absence of immigration) 

At time to, asymptotically as N ^ oo we have the following three properties for the dynamics 
introduced above in ^3. 664\ \3.665]) . 

(a) There is a Poisson number of sites j at which a;^ (j, io) > e where the parameter is 

(3.666) m / Q{iu{to - s) > e)ds. 

Jo 

(b) At time to, in the limit N — >■ oo, there are a countable number of sites contributing to a 
total mutant mass of order 0(1). 

(c) For each t > the states oflf'"'' converge in the sense of the weak atomic topology to 

(3.667) 5ron[0,l], 

with atomic random measure specified by the two requirements 

(3.668) atom locations i.i.d. uniform on [0,1] 

and atom masses given by a Poisson random measure on [0, 1] with intensity measure 

(3.669) mvt{dx)^ where vt{dx) = I Q{w : w{t ~ s) ^ dx)ds. 

Jo 

The Palm measure Vt of Vt satisfies (recall flf^ from Lemma \3.19\ (d)): 

(3.670) dt =^^. □ 

Proof of Lemma 13.201 fa) It follows from Lemma [3. 191 (a) that 

(3.671) lim N-P[x^{i,t)>e]^mQt{{e,l])- 

Therefore in the collection of N sites it follows that asymptotically as — ?> cxj the number of 
sites j with 2?^ (j, to) > e (recall that in this step we suppress immigration and we are considering 
only mass originating at this site) is Poisson with parameter m(Qt((e, 1]). 

(b) Since (^(((O, oo)) — oo and t't(((5, oo)) < oo for 5 > 0, there are countably many atoms in the 
limit as — > oo. 

(c) Consider the sequence obtained by size-ordering the atom sizes in Uf^'™. Note that the 
limiting point process on [e, 1] is given by a Poisson number with i.i.d. sizes and the distribution 

of the atoms is given by tt- It follows that the order statistics also converge and therefore 

_ , \ Qt([£,l]) ^ 

the joint distribution of the largest k atoms converge as iV — > oo. 

To verify that the limit is pure atomic we note that the expected mass of the union of sites 
of size smaller than e converges to as e — >■ uniformly in N by the explicit calculations in the 
next section (cf p.706p V The convergence of Ilf^'™ in the weak atomic topology then follows by 
Lemma 11.61 

Step 3 ( Completion of proof of convergence ) 

Here we have to incorporate the migration of mass between sites in particular the immigration 
to a site from all the other sites. Note that the immigration rate at fixed site is as the rare mutation 
of order A''"-'^ as long as the total mass is 0(1) as N oo and therefore we get here indeed an efi'ect 
comparable to the one we found in Step 1 and 2. Wc can show directly that the first two moments 
of the total mass x^(i) = 2^''™{[0, 1]) converge to the first two moments of X2{t) = 3™(i)([0, 1]). 
These moment calculations are carried out later on in Subsection [331 ii^ particular the first moment 
result is given by p.86ip and the second moment is given by (|3.918p . This gives the tightness of 
the marginal distributions. 
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Consider the sequence of time grids £ — 0, 1,2, . . .} with index k and with width 2 

Then the migration of mass between time ^ and and the mutation in this interval wiU be 
replaced by immigration from other sites and by rare mutation at the end of the interval. We 
then define using this idea an approximate evolution in discrete time by an approximate recursion 
scheme of atomic measures on [0, 1]. 

Let Tt{x) denote the evolution of atomic measures, where each atom follows the single site 
Fisher- Wright diffusion with emigration and the initial atomic measure is x. Furthermore let Y^''' 
be an atomic measure with a countable set of new atoms such that the atoms in Y^^'*^ are produced 
as above in Step 1 if we observe the process at time 2"*^ but now the intensity of production of a 
new atom is instead of just m in the previous steps now given by: 

(3.672) m + cx^{^), 

where is given in p.624p . Then define: 

(3.673) xf(^) = T_^Xf(^) + yf^^ fcGN, 

which defines, for each N and a fixed parameter fc G N, a piecewise constant atomic measure- valued 
process which we denote by 

(3.674) O^'^nt^o- 

We will now first investigate the weak convergence of this process, in the parameters fc, N tending 
to infinity. First we focus on the weak convergence w.r.t. the topology of the weak convergence of 
finite measures on the state space and then we pass to the (stronger) one w.r.t. the weak atomic 
topology on the state space. 

Consider first TV -> oo. For given k and t, as — >■ cx), the random variable jf^''^'™, converges 
weakly even in Aia{[0, 1]) to 2"^'^'"^ as proven in Lemma [3.201 (c). We can then obtain the con- 
vergence of the finite dimensional distributions of (2f ' '™)t>o to those of (H^' ''")t>o- Tightness 
and convergence is obtained as we shall argue at the end of this proof so that we have as ^ oo 
weak convergence of processes to 

(3.675) OT''nt>o- 

Next we let fc — oo and consider the weak topology on the state space. Note first that the 
random variable in (|3.675|) is by construction stochastically increasing in fc, since passing from k 
to A: -|- 1 we pick up additional contributions from newly formed atoms. We can then show that 
as A: — >■ oo the 2^'^^'"^ converge in distribution to Similarly we can take the limit fc — >■ oo in 

2N,k,m -|Af.m^ 

We can then use the triangle inequality for the Prohorov metric p(!lf '™,!]™) to get our result. 
Namely from the weak convergence of J^'*^'™ to as TV — >■ oo, the convergence as fc — >■ 

oo of 2f. ' to 2t as well as the convergence as fc — > oo of H^' '™ to J™. Using then the 
Markov property and a standard argument we can verify the convergence of the finite dimensional 
distributions of !lf to the finite dimensional distributions of J™. 

It now remains to check for (!l^'™)t>o as ^ oo the tightness condition of the laws and 
then convergence actually holds also in C([0, oo), {Mai[0, l]),Pa))- We first prove tightness with 
respect to the weak topology on the state space, i.e. we first prove tightness in the path space 
C([0,oo),Al/([0,l])). 

We write the state of the process as 

N 

(3.676) jr''" = ^af(i)<S,«(,). 
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Then it suffices to show that the laws of 

(3.677) {f]/(afW,xf(i)), NeN} 

are tight for a bounded continuous function / on [0,1] x [0,1]. But the {t) do not change in 
time and the a^{t) are seminiartingales with bounded characteristics and are therefore tight by 
the JofFe-Metivier criterion. 

To get to the convergence in path space based on the weak atomic topology on the state space of 
the process, we next note that since the joint distributions of the ordered atom sizes at fixed times 
converge, the locations are constant in time (as long as they are charged), we have convergence of 
the finite dimensional distributions in the weak atomic topology. The verification of the condition 
for tightness in C {[0,oo), Ma {[0,1])) ([^63)) follows as in the proof of Theorem 3.2 in [EK4] . 

3.3.2 The long-term behaviour of limiting droplet dynamics 
(Proof of Proposition [TTTTI) 

We now investigate the behaviour of the limiting (N — > oo) droplet dynamic, where again the dual 
process is the key tool, now the dual process of McKcan-Vlasov dynamics and certain subcritical 
Fisher- Wright diffusions. 
Proof of Proposition 11.111 

We prove separately the three parts of the proposition. 

a) Let (3™)i>o be defined as in Propositions 11.51 We begin by deriving a formula for the first 
moment 

(3.678) mit) ^ E[2T{[0,1])], 

given !I™(-) with 3™([0, 1]) < oo, which is accessible to an asymptotic analysis. 

First we introduce a key ingredient in the formula for m(-) and we obtain its relation to the 
exponential growth rates a* respectively a. 

Let 

(3.679) f{t) = / w{t)Q{dw) = lim -E,[x2it)], 

J Wo "^^0 £ 

where X2(-) denote the solution of (|1.33p and Eg refer to the initial state e. We now obtain an 
expression for p.679p . 

To compute the first moment of X2(t) we use the dual representation for the moments of the 
SDE (|1.33p . The dual process to be used then is {r]t,T^)t>o with one initial factor li(-). The 
dual particle process {rit)t>o then is effectively a birth and death process denoted (£'o(^))t>o on 
No, with a dynamic given by the evolution rule that the process can jump up or down by 1 (birth 
or death) and the rates in state k arc given by: 

(3.680) birth rate sk, death rate ck + -^k{k — 1) 
and 

(3.681) initial state Dq = 1. 

Then the dual expression is given by 



(3.682) E,[x2{t)] = 1 - E[{1 ~ e)''»^% 
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Therefore 

(3.683) f(t) = / w(m{dw) = lim -EJx2(t)] = lim — ^ = E\DM)] 

Jwn ^-^0 £ £ 



I Wo 

Now let a* be chosen so that 

/•OO pOG 

(3.684) c e-°''''f{r)dr = c e~°'''' E[Da{r)]dr ^ 1. 

Jo Jq 

RecaUing that by (|3.138p . (recall the death rate of -Do(0 is zero if k — 1) 

(3.685) / e-''''-E[Do{r)]dr^ / e-''''^E[Do{r)lD„ir)>2]dr, 
Jo Jo 

it follows that a* = a with a as defined in p.l35p . (|3.137p . 

Now we are ready to write down the final equation for to(-). Consider 

(3.686) Xit) ^ CEy^{t)Sa^)t>0, 

where yi{t), ^yi{0) < oo, are those realizations of the (independent) Fisher- Wright diffusions 
(|1.33p which represent the atoms that were present at time without immigration. Furthermore 
assume that J^Dii.^) < oo- Then we consider the process (2™)t>o defined in Proposition 11.51 with 

i 

2* therein as given in p.686|) . 

By taking expectations in the stochastic equation defining H™, namely (jl.46p . we get if we define 

(3.687) g{t) = E[J2y,{t)], 

i 

the renewal equation 

(3.688) m{t) = E[1T{[0, 1])] = i?E y.W] + /o E[{m + c2T{[0, 1])) w{t - r)Q{dw)]dr 

i 

= g{t) + m f{r)dr + cra{r)f{t — r)dr. 

The last step is now to analyse the growth behaviour of {m{t))t>o using renewal theory. Mul- 
tiplying through equation p.688|) by e~" * we get an equation for (e^" *m(t))t>o in terms of 
(e-"**/(<)) and (e-"**5(t)) as follows: 

(3.689) e-"'*m(0 = e-"'*[((?(i) -f m / f{r)dr)]+cf (e-"*''m(r))(e-"*(*-'')/(i - 0)^^- 



This equation in e " */(i) and e " *g{t) has the form of a renewal equation. 
We want to apply now a renewal theorem to this equation. Define R by 

/•oo 

(3.690) R = c re-°''''f{r)dr. 
Jo 

Since f{t) and g{t) are continuous and converge to exponentially fast, it can be verified that a{u) = 
g-Q u ^g(^u^ -|_ /(r)(ir] is directly Riemann integrable. Therefore by the renewal theorem 
([KT], Theorem 5.1) we obtain from p.689p that: 



(3.691) lim e-"**TO(t) = ^ , 

t— >-00 it Jq 



g{u) + m / f{r)dr 



du G (0, oo). 
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Hence e " converges as t — > oo to the r.h.s. of (|3.69ip which concludes the proof of part 

(a) of the proposition. 

(b) Let 

(3.692) J2(i)=3r([0,l]). 

The convergence in distribution of e~°'*X2{t) as t oo follows since by a) the laws of {e~"*X2(t), t > 
0} form a tight family and we have to exclude only that several limit points exist. We know that 
for any r > 

(3.693) lim i;([e-"'*+")j2(t + r) - e-'^'Mt)^) = 0, 

t—^oo 

which is proved in Proposition 13 . 27l (Recall here that over a finite time horizon [0, r] the mutation 
can be ignored and hence the proposition applies). We have to strengthen this statement p.693p 
to 

(3.694) lim (sup£;[e~"(*+^)S2(t + r) - e-°''x2{t)f]) = 0. 

This extension is provided in Corollarv l3.28l of Subsubsection [57?751 where it is proved using moment 
calculations. This implies that there exists W* such that 

(3.695) X2{t) — > W* in L^. 

The non-degeneracy of the limit W* follows from (c). 

(c) This claim on the variance of W* follows from Proposition 13. 2 II in Subsubsection [373.51 

Remark 41 The proof of (c) demonstrates that the randomness arises from the early rare mutation 
events. 

3.3.3 Proof of Proposition [TT9l 

This proof builds mainly on the previous subsubsection and on calculations we shall carry out in 
Subsection 13.41 

(a) This follows from Proposition II. 101 

(b) Recall that we know already that e~"*iz;^ (t) converges in law as — > oo to a limit e~"*X2(t) 
and hence by Skorohod embedding and our moment bounds we have convergence on some space, 
furthermore e~°'*X2{t), converges in as t — > oo to a limit W* as proved in p.695p above. Here 
we shall show that 

(3.696) lim lim £;[(e-"*zf (t) - e-"*"x^(iAr))2] 

t— >oo Af— >oo 

and then we use (|1.79p to get the claim. The above relation follows via Corollarv 13.311 which is 
proven by moment calculation which requires subtle coupling arguments which we shall develop in 
Subsubsections 13.3.51 and 13.3.61 

3.3.4 Some explicit calculations 

The entrance law from for the type-2 mass can also be derived using explicit formulas for densities 
of the involved diffusions both for Fisher- Wright and branching models. We collect this in the 
following two remarks. 
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Remark 42 Consider the Fisher- Wright given by the SDE 

(3.697) dx{t) = ^[1 - x{t)] + - x{t)]dt + sx{t){l - x{t))dt + d ■ x{t){l - x{t))dw{t), 

x(0) 

where y(t) — x^ (t) is inserted as an external signal. By Kimura \Kimlf if m = c = s ^ 0, then 
there is no N -dependence and the density of the solution at time t is 

f{p,x;t) 

(3.698) = P(l - P)i{^ + l)(2j + i)Fil - i + 2, 2, p) • F{1 + 2,2, x)e'^'+^^'^*/^ 

= 6p(l - p)e-'^* + 30p(l - p)(l - 2p)(l - 2x)e-^'^* + ..., 

where F{a, j3,^]x) is the hypergeometric function. 

Consider now the neutral Fisher- Wright diffusion, s = and constant y{t) — m, c > 0, d ~ 1. 
Crow-Kimura (see \GRDf . Table 3.4, or Kimura \Kiml^ . (6.2)) obtained the density at time t 

f^{y,x,t)= a;^-i(l -a;)'"-'^-! 

where the initial value of the process is y and 

yni 711 111 

(3.700) ^ = = 2(- + c), A, = z[2(- + c) + {^ - l)]/2, 

(3.701) F,{x)^F{v + i-l,-i;v- ii;x), 

where F is the socalled hyper-geometric function that is that solution of the equation 

(3.702) a;(l - x)F" + [7 - (a + /3 + l)x\F" - a^F = 0, 
which is finite at x = 0. 

Setting y = and taking — > 00, then for every t > and x G [0,1], taking f^{x,i) = 
f^{Q, X, t), we get that there exists a function f°°{x, t) which is for every t the density of a a -finite 
measure such that: 

(3.703) Nr{x,t)^r{x,t). 

The limit defines an entrance type law. Note that f^ has the same behaviour near zero as N 00 
as the branching density below in 705\ l. 

Consider now the case s > 0. Then selection term then leads to a new law which is absolutely 
continuous w.r.t. the neutral one and the density is given by the Girsanov factor 

Is /■* , s2 rt 

(3.704) Lt = cxp - / ^yd-x{s){l - x{s))dw{t) - tth d, ' - x{s))ds 

L" Jo 2a Jo 

Therefore the selection term does not change the asymptotics since it involves a bounded Cirsanov 
density (uniform in N). 

Alternatively to the above argument we can take the Kimura solution (4-10) of the forward 
equation and take the limit as p — > 0. □ 

Remark 43 For comparison, consider the branching approximation where there is no competition 
between excursions and we obtain for initial state the Camma density 

(3.705) f^ix,t) = ^xT^-^e-^/'^", a; > 0. 

Then Nf^ converges to the density x^^e^^^'^ on M+ of a a -finite measure. Moreover 

(3.706) N ■ / xf^{x, t)dx ^ os e ^- uniformly in N. □ 

Jo 



3 PROOF OF RESULTS 



141 



3.3.5 First and second moments of the droplet growth constant W* 

Continuing the study of the droplet growth we now turn to times in [To,Tn] as given in p.62ip 
and (|3.622p . In the sequel we compute the limits as iV oo of the first two moments of the 
total type-2 mass in the population denoted 2?^ (tAr), for tjy — )■ oo, t]\[ = o{\ogN). The present 
calculation allows in particular to show the existence of W* and to determine its moments. 
As before we use the dual representation to compute moments. For example, recall that 



(3.707) E[x^{iM)] = E[exp{-^ J Hf^'ds)], 



where H^'^ is of the total number of particles in the dual cloud in {1, . . . , N} starting with one 
factor at site i at time t = 0. As developed earlier this cloud is described by a CMJ process 
and has the form 

M'jv(t)e°' 

(3.708) n^''^ 

where W^Af(0 ~^ ^{^) as — > oo and W{t) — W a.s. as t — ?> oo and the processes at the different 
sites satisfy £[(Cj^(i))f>o] C[{(j{t))t>o] as A^ — t' oo, where the (Ci(0)t>o for J € N are all 
evolving independently and are given by the birth and quadratic death process at an occupied site 
with the same dynamic but starting from one particle at the random time, when the site is first 
occupied. The dependence between the different occupied sites arises through these random times 
even in the A^ — >■ oo limits. 

If we consider second moments we have to start with two particles and then get the analog of 
p.707|) but in the representation of the r.h.s. of this formula given in p.708|) . WN{t) and have 
to be replaced accordingly and in the limit N ^ oo the W = W^'^ is replaced by W^'^. 

We have noted above that if tjv = o(logA^) then the probability that a collision between dual 
particles due to a migration step occurs up to time goes to as A^ — > oo. The new element that 
arises in this subsubsection is the fact that in the computation of the higher moments of X2 (ijv); 
these collision events must be taken into account. The reason for this is that we will compute 
moments of the mutant mass (t) in terms of the moments of (t). We see below from the 
formula (|3.716p that in the expansion of fc-th moments of the total mass powers of A^ appear, which 
forces us to consider in the expansions of these moments as well as higher order terms in A^"-'^. 
Hence we have to analyse more carefully the overlaps in the dual clouds corresponding to different 
ancestors which have probabilities which go to zero but not fast enough to be discarded. 

The key tool we work with is the coloured particle systems (WRGB) introduced in Subsubsec- 
tion [STl 

The main result of whose proof this subsubsection is devoted to is: 

Proposition 3.21 (Limiting growth constant of droplet) 
Let ijv — ^ 00 and 

(3.709) limsupT^^ < 

^ ' AT^oo logAT 2a 

(a) The family 

(3.710) {e~"*"xf (tjv) -.NeNjis tight, 
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with non- degenerate limit points W* with 
(3.711) < E[W*] <oo, < Var[W*] < oo. 

(b) Furthermore: 
(3.712) 



lim e-"'*"E[x^{tN))]^m*E[Wi], 



(3.713) lim 



7V- 



E[x^ {tN)r] ^ {m*yiEWiy + 2m*{EWiyK2, 



where a is defined as in \S.139\) . K2 > f which is given precisely in \3.745^ ) and 
(3.714) m* = ""^"^^^ with b = l + ^. 



(c) The third moment satisfies 
(3.715) limsup£;[e-3"*"(xf (iA,))3] < oo, 

so that the first and second moments ofW* are given by the formulas on the r.h.s. of ^3. 712^ and 

The proof of the main result is given in the rest of this subsubsection which has five parts, 
namely we require four tools to finally carry out the proof. 

(1) First we give a preparatory lemma [3.221 that gives asymptotic expressions for the moments 
of X2 (ijv) in terms of the moments of x^(tjv)- 

(2) In order to calculate the moments of x^(tjv) we must determine the growth of dual clouds 
starting with one or two initial factors. This is stated in Proposition 13 . 24l and is then proved. 

(3) Here we introduce a multicolour system to analyse the two dual clouds and their interaction 
starting both from one particle (at different sites). 

(4) Then we present in Lemma 13.261 the tools to obtain the asymptotics of the expectations of 
the exponentials involving the occupation times of the dual clouds. 

(5) Finally we complete the proof of the above main result, the Proposition l3.211 using the dual 
expressions at the end of the section and the results from the previous Parts 1-4. 

Part 1 (Moment formulas) 

Lemma 3.22 (k-th moment formulas) 

(a) We have the k-th moment formula for the total type-2 mass: 

/ 1\ \ 

(3.716) 



N,2, 



m,y'{tN) 



N 

J2 X2 ii,tN) 

i=l 



= 1,2,' 



where rhf'^ is the £-th moment for the total type-1 mass. 

(b) The term on the r.h.s. of {3.716^ is calculated as follows. Let q — (gi, 
be the dual starting with qi-particles at site i = 1,- ■ 



■,qj) and let 11^ ^'^ 



(3.717) /i,,, 
Then 



r,q{'^N) - E 



cxp 



N ' ° 




(3.718) mfi(t^)=^('M y: 

.7 = 1 ^ ^ qi + ...q.i=q 



91 ! 



■<1] 



□ 
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Proof of Lemma 13.221 

(a) First recall that 



N 



N 



(3.719) irfW = 5]xf(z,0, x^it) = Y,^2iht)^N^x^{t) 
and therefore 



1=1 



(3.720) 



N 



Eh 
e=i 



Vi=l 



{^ifN^-'^E 



fc = l,2,... 



Recalling the definition of fri^'^{t) the claim follows. 

(b) Moreover we can express the moments of the mass of type 1 appearing on the r.h.s. of 
p.720p by expanding according to the multiplicity of the occupation of sites in the mixed moment 
expression, namely: 



(3.721) 



N 



N 



E E[x^i^^,t)...x^i^k,t)] 



The moments on the r.h.s. above can be computed as follows. Define 



(3.722) fig,_g^.g{tN)^E 

Then 



distinct, 



qe = q- 



(3.723) mfi(t^) = E 



N 
j 



E 



qi\...qj\ 

qi+...qj=q ■> 



tJ-qi,...,qj; g(ijv)- 



Next use the duality relation to express fj,q-^_...^q.-q. Define 



(3.724) ni^--^^)'^, 

as the total number of dual particles at time In starting the particle component ?yt of the dual 
process {T^t^T^^) with 

(3.725) qi particles at distinct sites i = 1, • • • , j. 
Then write 



(3.726) tlq„...,q^,q{tN) = E 



exp 



N 



q.e.d. 

Part 2 (Growth of dual clouds) 
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Looking at the r.h.s. of formula p.716p - (|3.718p wc sec that for calculating higher moments we 
have to study the asymptotic (as iV — > cxj) behaviour of the dual system starting with more than 
one particle. Of particular importance is to estimate the overlap in the dual particle population of 
different families due to different ancestors at the starting time of the evolution since this will allow 
to control the terms of order N^'^ which are needed to compensate for the (^) term on the r.h.s. 
of (|3.716p . This is addressed in the form needed for the calculation of first and second moments 
in the following definition and proposition. 

Definition 3.23 (Dual clouds for moment calculations) 

(a) To calculate first and second moments at one time we calculate the dual particle number 
asymptotically as follows if we start with one or two particles. We choose Wa as: 

• Wa = Wi where Wi = W^'^ = W defined by ^3.143^ starting with one factor at time 

• Wa = W^''^ = Wi + W2 where Wi,W2 are independent copies of the random variable W 
arising from one factor at each of two different sites, 

• Wa = VF^'^ which is the corresponding random variable when the CM J process is started with 
the two factors l^iy <S) l^iy at time at the same site. 



Similarly we write Wa{-) for the function {Wa{t))t>o arising from e "^K^. 



Let ^ be a time-inhomogeneous Poisson process on [0, 00) with intensity measure given by 

(3.727) 

^ ' c N 

Note ^ depends on the chosen a but only through Wa. 
Let 

(3.728) g{u,s) 

( which evolves independently of the parameter N ) be the size of that green cloud of sites of age u 
produced by one collision (creating a red particle) at time s. Then g{t, s) has a law depending only 
on t ^ s, namely 

(3.729) g{t-s)^e°'^'-'^W\t^s), 

where W^{t — s) — >■ W^ as t — s 00 and W^ is defined by this relation. 

Furthermore introduce for a realization of ^ a process Q depending on the parameters N and 
whose law depends on the choice of a but only through Wa(-) and which describes the size of the 
complete green population at time t: 

(3.730) 5iWai-),N,t)^ f git,s)ads). 

Jo 

Analogously to g we define the pair 

(3.731) 5tot,0tot 

as the number of particles in the green cloud, instead of the number of sites, 
(b ) To compute the joint second moment at two different times we consider 



(3.732) Wa = Wj, + W^ 



which is the corresponding random variable when the CMJ process is started with the factor l|x} 
located at ji at time and an additional factor l{x} is added at location j2 at time s > 0. 
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(3.733) 



Let be an inhomogeneous Poisson process on [0,oo) with intensity measure 

Wi{u)W2{u)e^'"'' 



(a + 7)- 



N 



du. 



Then with 

(3.734) t^tN, 

we consider time t + s. Furthermore set (recall jj*. 734^ ) 

(3.735) Qiia){W^{-),W2{-),N,t,t + s)^ g{tN,u)^^^^^\du). 

We denote by j^^'^^i^'^^'^-'^'*) ^/jg number of occupied sites in the dual particle system starting 
with a particle at ji at time and one at time j2 at time s. □ 

Now we can state the main result on the growth of the dual clouds. 



Proposition 3.24 (Dual clouds) 
Let 

.log TV ^ 

(3.736) t = ijv, where t^ oo, tN — o( ), 

more precisely, 

(3.737) 2atN -\ogN -oo as N ^ oo. 

Then consider the growth of the dual cloud in various initial states indicated by the subscript a. 
(a) We have 

(3.738) i^f'^"' = Wait)e''' - Q{Wa{-),N,t) + £a{N,t), 



(3.739) n 



N,(a) 



a, tot 



where we have for some K2 S (0, oo) 

(3.740) lim Ne-^"'-E[Qtot{Wai-),N,tN)]=K2E[{Wa)' 
and the error term £a or Ea.tot satisfies that 

(3.741) lim £:[iVe-2"*"|f„(Af,^^^)|] = 0. 

(b) We have (as N oo, recall {3.734\) ): 



„3at 



(3.742) A',^^p^^°)'(^-^) ^ (^Wie"'^'+''> +W2e''') - 2{W,{-),W2{-),N,t,t + s) + 0{j^) 



and 



c N'^ 
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where gtotiWi{-), W2{-), N,t,t + s) satisfies the two conditions 

3totiW^^H-),N,t,t + s)^ Qtot{Wii-),N,t + s)+Qtot{W2{-),N,t) 

^ ^ ^ +0(l,2),tot(W^l(-),W^2(-),^,S,t), 

and 

(3.745) lim N ■ e-''^^*''+''>E[Q^i,2),tot{Wi{-),W2i-),N,tN,tN + s)] ^ 2k2E[Wi]E[W2]. 

Similarly such relations hold for g instead of gtot with K2 replaced by the appropriate different 
positive constant. 

(c) Now consider the case in which TV — >■ oo but t is independent of N (that is, we no longer 
take t]\[ — > oo). Then we do not replace Wi{-) by Wi and a{t),"f(t) by a, 7. Then in ^3.74^ , 
13. 74S\ I Wi ■ {a + 7) is replaced by Wi{t + s){a{t + s) + j{t + s)) and W2 ■ {a + 7) is replaced by 
M^2W(a(0+7(t))- □ 

Remark 44 Note that in the expansion {Wi + ^2)^ = Wi + 2W1W2 + W| we can associate the 
term 2W1W2 with collisions of a particle from one cloud with sites occupied by the other cloud. In 
order to asymptotically evaluate these collision terms we could keep track of the white, red, green, 
blue particles from the two sites by labelling them, for example, by 

(3.746) (Ty_,i?_,G_,B_), {W+, R+,G+, B+), 

respectively, and then considering the collisions of with , etc. 

Remark 45 Note that the existence of K2 follows exactly in the same way as the argument leading 
up to {3.507^ (pfoof of existence of k) in Subsubsection \ 3.2.9l The only difference here is that 
it involves the growth of the number of green sites arising from either self collisions or between 
collisions of two different clouds. 



Proof of Proposition [3724] (a) In order to justify (|3.739p wc now reconsider the development of 
the dual particle system on {1, . . . ,N} starting from one factor Imj at location 1. We have used 
above the fact that for the range < t < where limjv_»po Q iogAf < li the particle system is 
described asymptotically as — > 00 in first order by a CMJ process with Malthusian parameter 
a. Correction terms are analysed using the multicolour particles system. 

Moreover if we follow two families starting with one factor Ijij at locations 1,2 respectively, 
then the probability that they collide, that is occupy the same site in {1,...,A^}, in this time 
regime goes to as — > 00. This is essentially equivalent to the statement that the probability 
that a green particle is produced in the WRGB system in this time regime goes to zero as TV — >■ 00. 
However for finite TV these probabilities are not zero and we now determine them by expansion in 
terms of powers of TV"*^. 

Recall that the system of white and black particles is a CMJ process as described above in 
which the number of occupied sites has the form 

(3.747) W{t)e"^ 

where limt_i.oo W{t) = W and this black and white system serves as an upper bound for the 
system of occupied sites in the dual process. Hence if at time t a migrant is produced by this 
collection following the dual dynamics, it moves to a randomly chosen point in {1, . . . ,TV}. Then 
the probability that it hits an occupied site is therefore at most: 



(3.748, ^ 
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2at 



Therefore a upper bound for the process of colhsions is given by an inhomogeneous Poisson process 
^(s) with rate 

(3.749) ^ 

and number of sites that have been hit this way in the time interval [0,t] is at most 

RecaU that W* = sup W{t) < oo a.s. For bounding second moments we can also show that 
t>o 

supoQ £'[(VF(t))^] < oo. In particular the number of sites at which collisions take place by time 
tN is o{N), i.e. has spatial intensity zero as — > oo. 

Moreover the expected number of sites to be hit more than once in the time interval [0, t] goes 
to as — >■ oo at the order A^~^. This order holds also for red- red collision by migration in the 
multicolour particle representation. 

Next we recall that when a white particle hits an occupied site it produces a red particle at 
this occupied site. Subsequently if the red particle is removed by coalescence with a white particle, 
then a green particle is produced. Due to the coalescence process eventually no red particles will 
remain at this site. Also due to migration eventually this site will revert to a pure white site. 
During the "lifetime" of such a multicolour site there is a positive probability that one or more 
green particles will migrate and then produce a growing cloud. (Recall that the lifetime has finite 
exponential moment.) 

Once such a new site containing white and red particles is formed, a green particle is created 
by coalescence unless either the white or the red disappears before coalescence. The number of 
green particles to migrate from such a special site is random with probability law that depends 
on the number C(s) of the occupying white particles at the time s of arrival of the red particle 
(asymptotically as — > oo no further red particle will arrive at this site before the coalescence or 
migration step of the red particle or disappearance of all whites at the site). 

In other words there is a positive probability that the green particle will produce green migrants 
before it coalesces with a red particle (such that only one red particle remains and the green particle 
disappears (recall Remark [30])). We denote the probability p^(s) that at least one green migrant 
is produced during the lifetime of a special site due to a collision at time s satisfies 

(3.751) (s) — > p{s) as N ^ oo, p{s) — > p > as s ^ oo. 

After the migration step the resulting green family, if non-empty, grows according to the CMJ 
process. Combining the two facts we see that the number of green sites this family occupies at 
time t, denoted g{t, s), satisfies: 

(3.752) g{t,s) =WS{t,s)e"^'-'\ 

and limt^oo W^{t, s) = exists and converges to as s — !• oo. Note that the event {W^ — 0} 
occurs with the probability that a collision does not create a green migrant (see Remark H51 below l 
and otherwise 

(3.753) W^{t, s) :— is the random growth constant for all the sites occupied by 

green migrants from a collision at a site at time s in the interval [s,t\. 

Also W^{t,r) is independent of W{-) and W^{t,r),W^{t,s) are independent for r ^ s. The 
analogous quantity W^^^(t,r) is defined to be the number of green particles produced in [r, i] by a 
collision at time r. 
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Define the analog of W^{t, s) in tlie case wliere tlie wliite sites are in tlie stable age distribution 

as 

(3.754) W<^'*(t,s). 

If we assume that the white population has reached the stable size distribution (for the number 
of white particles at an occupied site) at time s, then the probability law of the corresponding 
process W^'*{t, s) depends only on t — s. 

Remark 46 We note that at times t > sn with sjv — >■ c» the distribution of in the CM J process 
population approaches the stable age distribution as N ^ oo uniformly in t > sn- 

Then combining p.750p and (|3.752p we get that 

(3.755) E[g{W{-),N,t)]< f E[{W {sff]^ E[W<^{t, s)]e'''^'-'Us < const 







Provided the first inequality could be modified such that it becomes up to terms of order o{N ^) 
an equality, the following limit exists 

/>oo 

(3.756) \im Ne~^°'*''E[Q{W{-),N,tN)]^ K2E[W^], K2 = / e-°'"E[WS'* {u)]du. 

We note that the probability of collision in [0, tN] tends to as iV — oo therefore we determine 
the probability of such a collision, the distribution of the collision time given that a collision occurs 
and the resulting number of green sites produced by this one collision. Given that a collision occurs 
the site at which it occurs is obtained by choosing a randomly occupied site. 

In order to verify that such a modification exists and that the assumption on the error in the 
first inequality in p.755|) we have to (1) estimate how much we overestimate the collision rate 
by using the CMJ process (of black and white particles) instead of the real dual (i.e. {W, R, P) 
particles) and (2) in the application of we need to control the error introduced by assuming that 
collision events involve sites with the stable age distribution for the white particles and replacing 
jtN g-a(t«-s)^[p^s((ijY^ as iV ^ oo by the second part of the equation p.756|) . 

(1) We again note that the over estimate of the number of green particles corresponds to events 
having two or more collisions and this occurs with rate and this correction as well as the 
exclusion of the purple particles produces a higher order error term. 

(2) Consider the early collisions before reaching the stable age distribution. Noting that 



(3.757) / e2""e"(*"-^)rfs = Co7is<-e"(*"+"") 
Jo 



therefore these contribute only a term increasing as e"*"+''™ whereas the latter ones grow at a rate 
e^"*". If Sjv ^ oo, Sat = o(ijv), then the early collisions form a negligible contribution. But then 
as A'^ — > oo, the white population approaches the stable age distribution by time sjy. 

Note that for the late ones we cannot assume that W^{t, s) takes on its limiting value and for 
this reason we obtain the integral in the definition of K2- This proves Part (a) of Proposition l3.24l 

(b) We now consider (|3.743p when we begin with one factor at site ji at time and one 
factor at j2 7^ ji at time s > and the total number of occupied sites at time t is what we 
called nf'^^'^\ This can be handled exactly as in (a). However we want to keep track of the subset 
of sites occupied by migrants coming from collisions between the two different families. For this 
reason we constructed the multicolour system. 

Finally we have to show that the bound p.788|) gives actually asymptotically the correct answer. 
Following the argument in part (a) we note that the upper bound on the 6*1^2 particles gives us 
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a lower bound on the white particles and this in turn gives us by (j3.786p with a bootstrapping a 
lower bound on the number of 6*1^2 particles. It then follows that the overestimate of the number 
of Gi^2 particles is no larger than const • ^^p— • 

The analogous arguments also yields the claim (|3.745|) for g(i_2),tot- 

(c) Other than including the time dependence for Wi{-) the proof follows the same lines as 
above. In particular the first two terms in (|3.742p now correspond to the CMJ process obtained 
when N ~ 00 and the limiting expected value of the correction term now has the form 

(3.758) lim N ■e-"'-^'+'^E[g^,^2)MWii-),W2i-),N,t,t + .s)]=2K2it)E[Wiit)]E[W2it)]. 

This completes the proof of Proposition 13.241 ■ 
Part 3 (Two dual clouds interacting) 

We want to compare the dual populations (note the different brackets and the ~ indicating the 
populations instead of just the population sizes) 

(3.759) n^' } , { J2 } , n^' J2 } 

corresponding to the descendants of the particles at ji , j2 , respectively evolving independently and 
the resulting population starting with both initial particles and evolving jointly. In order to carry 
out this comparison we construct a coupling, i.e. construct all three populations on one probability 
space in such a way that the respective laws are preserved. At the same time we want to be able to 
compare these populations with the respective collision-free versions evolving as CMJ-processes. 
Construction of enriched multicolour system. 

To construct this coupling we use again the WRGB-system and we construct the particle system 
starting with 1 particle at both sites ji and j2, recall here the constructions from subsubsection 
11.191 We now modify this coloured WRGB-particle system by marking the offspring of the initial 
particles at ji, ^'2 by using an enriched colour system, i.e. using the colours 

(3.760) {WuRi,Gi,BuPi,W2,R2,G2,B2,P2}. 
together with 

(3.761) {w^,w;,r*,,rigig*2,bib;,p*,p;,r,,2,g,,2,b,,2,Pi.,2}- 

The second set of colours is reserved for particles involved in a collision between the two different 
families or their descendants. 
The new features are 

• When a Wi particle hits a W2 particle the result is a and i?i^2 pair as well as the original 
W2 particle at this site. A particle does not coalesce with the W2 particle. 

• If the Ri^2 particle coalesces with the W2 particle, the result is the pair consisting of a W2 
particle and a Gi,2 particle. 

• The above holds with 1 and 2 interchanged. 

• If W* hits a Wi particle it changes to a Rl particle. 

With this expanded colour set, we now have to define a suitable multicolour dynamic. We define 
this system more precisely after outlining the goals of this construction. This is done so that the 
{Wi,W* , Ri,Gi), i = 1,2 describe the families in the absence of interactions between them. The 
particles (i?i,2) Gi_2) correspond to the particle families resulting from the interaction between the 
two families and {Wi,W2, i?2, Gi, G2, -Ri,2, ^1^2) describe the interacting system. As before we 
want to couple the interacting system with a CMJ process. 
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Recall that the covariance of the type-1 mass in two locations if we write it in the form — 
consists in the dual representation of a cloud of dual particles evolving jointly from two 
initial particles at different locations and two independently evolving clouds starting from one 
particle each. Hence the idea is that the covariance of x^{i,t + s) and x^{i,t) is determined by 
the particles in 

(3.762) {n^-{i> u n^'^2}}\^nAr,{i,2}^ 

where n^^ti), Il^^t^} denotes the two independently evolving dual particle systems and n^^'^^ the 
jointly evolving ones. The set n^'{i} IJ n^^t2}^]^Af,{i,2} corresponds to the G1.2 green particles. 

We list now the precise evolution rules of the new coloured particle system. In the evolution 
of the populations arising from the two initial particles as before particles migrate, coalesce and 
give birth to new particles at the rates as in the dual. One modification is required, namely, in 
this case we allow singletons of one family to migrate to sites occupied by the other family and 
also singletons of one family in the presence of one or more particles of the other type can migrate. 
Here are the precise rules. 

(0) The rules for the particles and colours appearing in p.760p are exactly as before as long as 
the two populations (i.e. particles with a colour with one index either 1 or 2) do not interact 
or occupy the same site. If particles of the two families occupy the same site singletons of 
one family can now migrate. 

Furthermore we have the four additional rules concerning the colours newly introduced in 
(1X7611) : 

(1) When a white particle from family 1 and a white particle from family 2 coalesce the result 
is one W*, one W2 particle and a R12 particle. The particles evolve according to the 
rules of the dual particle system, but ignoring further collisions with the respective other 
population and upon collision with the own population a i?|-particle is created replacing the 
W^*-particle. Similarly proceed with the other ^-colours. 

(2) i?i.2 particles that coalesce with Wi (respectively W2) produces a Wi (respectively W2) and 
Gi,2 pair. 

Two R12 particles that hit an occupied site produce blue-purple pairs (i3i,2, -Pi,2) where 
as before the B12 particle is placed on the first empty site in the first copy of N continuing 
according to the collision- free dynamic and the Pi ^2 particle behaves as a typical dual particle 
for n^.{i^2}_ 

(3) Coalescence of G1.2 and i?i.2 particles produce i?i.2 particles (cf. Remark [50)) . 

(4) If a R12 or G1.2 particle more generally every colour in (j3.76ip gives birth the new particle 
carries the same type. 

These set of rules uniquely define a pure jump Markov process on the state space describing the 
individuals with their colour and location and with the coupling information (recall p.394p for 
how to formalize this). 

The key properties of the new coloured dynamics are the following. On the same probability 
space we have the various systems we consider. Namely 

(3.763) n^'-f''^ = union of the Wi, W* , R^, P^, R* , P* , for i = 1, 2 

The new system also includes our dual starting with two particles, namely, (by construction) 

(3.764) n^'-fi-^} = the union of the Wi,W2, Ri, R2, A, ^2, i?i,2, Gi,2 particles. 
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Furthermore we can again identify a CMJ-process, 

(3.765) union of (M^i, W2, ^1, ^2, Gi, G2, 01,2, Si, i?i,2, Si_2)-particlesS CMJ process. 

Remark 47 To compute higher moments a complete bookkeeping of the difference would require 
more and more different colours whenever such a collision with subsequent coalescence occurs. 
However although these collisions do occur, we note that they do not contribute to the calculation 
up to terms of order N"'^ . Therefore we again ignore these higher order terms instead of introducing 
further colours. 

In the sequel we shall retain the and W2 particles but not the R'^jBg, particles since 
the order in N of these sets of particles are of too small to play a role for our purposes. 

Lemma 3.25 (Negligible colours) 

Let Ml denote the number of R^^Bl or Pg particles in the system £. Then 

3at 

(3.766) £;[A/;^] = 0(_). □ 

Proof of Lemma 13.251 This follows since these correspond to families resulting from two or more 

3 at 

collisions and these have order 0(^^). q.e.d. 

If we count only the occupation numbers by the various colours the state space is given by 

McK{Wi,W,*,Ri,Gi,Bi}U 

(3.767) 

{W2, Wi,R2,G2,B2} U {i?i,2, Gi,2, Bi,2, P1.2}) X ({{1, 2, . . . , iV} + N + N})] 

(where TWc denotes the set of counting measures). 

This concludes the construction of a coupling via a new enriched enriched multicolour particle 
system. 

Consequences of the coupling 

Using the arguments of part (2) above we can conclude that: 

nf '^^^ + nf ^ i((M/' + W")){a + 7)e«* - [stotiiW {■)), iV, t) + g,,,{{W"{-)),N, t)] 

(3.768) 

+£i,tot{N,t) + £2,tot{^^t)' 

(3.769) nf '^^''^ ^ -{{W + W")){a + 7)e"* - fltot ((W^'(-) + W"{-)), N, t) + f i,2;tot(iV, t), 

c 

where 

(3.770) £'^,tot(A^,t) = o(iV-ie2"*") as ^ 00, ^ = 1, 2 or (1, 2), 

(3.771) 9totiiW'{-) + W"{-)),N,t)= r gtot{tN,s)^^'''\ds), 

Jo 

with a time inhomogencous Poisson process on [0, 00) with intensity measure 

(3772) m'i-){s) + W"i-){s))re''^^ ^^ 

and gtotit,s) is the size of the green cloud at time t produced at time s by a newly created red 
particle at a white site. 
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We know furthermore that 



,2at 




(3.773) E[Qtot{{W' + W"){-), N, t)] = k2E[{W' + W" f] 



N 



Then combining all these properties p.735|) follows by comparing the expression for Qtot{{W' + 



has an upper bound which is generated by the Poisson process on [0, oo) with intensity measure 



This corresponds to the difference term (i.e. lost particles due to collision and coalescence between 
the two families) given by the G'i.2-particles. It is bounded above by allowing the rate of produc- 
tion of collisions leading to the production of G1.2 particles by the CMJ-process which excludes 
collisions. We can then obtain a lower bound by reducing the white families by the respective 
upper bounds for the sets of green particles as well as the blue particles. This replaces the white 
family VF(i)e"* by VF(i)e"* — 0{^j^). Using this we can show that the result is an error term 

which is of order 0(^^). 

We now want a more detailed description in which we keep track explicitly of the collisions of 
the two families and the resulting G12 families. We therefore consider 

(3.776) KI^'^^'^\ "il^^iWi , iW2 , *-Rl , *-R2 , *-Rl,2 , *Gi , *G2 , *Gi,2 , *-Bi , ^-82 , *-Bi,2 , *-Bi,2)5 

which denotes the number of occupied sites at time t, respectively the number of sites having 
iwi white particles from the first population, in/^ white particles from the second population, 
etc. Similarly we can define systems for subsets of colours as we did before. We also consider 
the pair (u^, U^) describing the new coloured particle system and a new corresponding limiting 
system {u,U) for — > 00. For the (Wi, VF2, -Ri, Gi, G2, i?i,2, Gi_2, ^i, -B2)-system we modify 
the equation for Uwrg b , namely, p.459p to include the contribution to the dynamics of collisions 
between Wi and W2 particles. Similar we can proceed with other colour combinations. 

Then we obtain the following representation formula. Here we use the convention to write 

(3.777) [iw,,iw;,iR,,iPj),U^ [iwi,iw2,iRi,2)^ • ' ' 

for Uj^ {■) all other colours not appearing in the argument summed out. Then: 




(3.774) ^f'l + ^f'2-^f^^'2^ t>o 



(3.775) 





N,{1.2} 



U'^it) J2 (ilVl + «Vy2 + ifll + *fl2 + *fll.2 + *Pl + *P2 + «Pl,2) 



(3.779) 



Ui^{iWi , iW2 1 : *fl2 : *fll.2 ' iPl 1 *P2 : *Pl.2 )i 



where the sum is over iwi , iw2 j *Pi j *-R2 i *Pi,2 i *Pi i *P2 j *Pi,2 • 
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Wc now focus on the are sites at which a colHsion occurs between a Wi particle and a W2 site 
(or vice versa) which can then produce migrating Ri,2 or Gi.2-particles. Therefore we define as 
"special" sites at time t, those sites at which 

(3.780) one or more Ri^2 particles are present at time t and also 

a Wi or W2 -particle is present at time t. 

Note that special sites have a finite lifetime and can produce migrating i?i,2j Gi^2 particles during 
their lifetime. Denote the number of special sites by: 

(3.781) K^^*=i iUn,,2>l■i^^^-,^o^U^-,M''i^w,,^W2,^R^,2)\ -u^it). 

\ ,*W2 1*^1 2 / 

In the calculation of second moments we can ignore errors of size o(N~'^), we can work with 
the system of Wi,W2, Ri, R2, Ri,2 and ignore the purple particles i.e. the colours Pi,P2,Pi,2- 
Therefore it suffices to determine the number of sites occupied by Wi,W2, Ri, R2, Ri,2 particles 
which estimates errors of order 0(^^). This results in the expansion given 

in p.742[) where in p.744p the three gtot terms correspond to the sites occupied by only Gi, G2 
and Gi^2 particles. It remain to estimate these terms. The Gi, G2 terms are similar to those 
considered earlier so we focus on the Gi^2 sites and particles. We will see below that key to the 
determining the asymptotics of the covariance at two sites as iV — )■ (X) is to analyse the number of 
Gi.2-particlcs asymptotically as N 00. We have now all the tools to start the estimation. 

Estimates on the number of Gi 2-particles 

In order to study the asymptotics of the number of G 1^2 particles at time ^at, in sublogarithmic 
time scales which we denote this number by: 

(3.782) n^^^^'^ with tN as in (ir736| . 

We next recall that independent systems of dual particle systems with starting particles in site ji 
at time and site j2, at time s > with ji 7^ j2, produce clouds as iV — > cx3 with independent 
stable size distributions and the number of sites of the two dual populations is denoted by 

(3.783) Kf'^'\K^-^'\ 

We then have by the CMJ-theory and the Kt for approximation — > 00, 

(3.784) K^^^^^ - We"'" , K^^^^^ - iy"e"*« . 

To identify the correction terms needed for second moment calculations we need to consider 
the dual particle system starting with two particles one at site 1, one at site 2 and estimate the 
effects of the collisions between the two clouds. Using our embedding in the coloured particle 
system this can involve a collision in which a Wi particle can migrate to a site already occupied 
by a 1^2 particle (or vice versa). Once a collision occurs, this produces a W*,Ri^2 pair. The 
R1.2 particle can move to another site before it coalesces with one of the W2 particles of the other 
colour. If it coalesces the W2 particle remains an a new Gi_2-particle is added at this site. The 
G1.2 particles can then reproduce and coalesce with other particles. This way they can produce 
more Gi^2 particles or migrate to produce a new Gi^2 site. The system of Gi^2 particles after the 
creation of a Gi^2-particle has the property that after foundation a Gi_2-family does not anymore 
depend on the other colours, therefore the Gi_2-system evolves as a copy of the basic one type CMJ 
particle system with immigration given by a randomly fluctuating source which is independent of 
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the CMJ-proccss. The creation of new Ri,2 partieles at time s is determined by the number of 
{Wi, W2)-pairs at time s. 

Suppose we are given the evolution of the Wi, M^2-particles. Then the events of colhsions 
between Wi and W2 particles is given by a Poisson process and when a collision occurs at a site 
there is a positive probability of a coalescence and therefore the production of a green particle. 
This then produces a growing population of Gi^2 descendants according to the CMJ-theory leading 
to the analogue expression to p.755|) which we had for Wi, W2-particles. 

If we replace the system of white particles of the two types by the independent site-1 and site-2 
system, we obtain an upper bound for the mean number of Gi^2 particles given the growth constants 
Wi and W2 of the two clouds, so that at time t = t]y provided that 

(3.785) tN^o{^^), 

a 

by just calculating the mean. 

This results in the following asymptotic population size for the Gi^2 particles which is then 
again given by the behaviour of the conditional mean (given the Wi , W2 populations) : 

(3.786) ^ / E[W]e"^'«-^Hn^'^'^ K^^^^^^ + Il^'^^h<^^^'^)p^ {u)du + oiN-'). 
^ Jo 

The real system requires a correction term due to the interaction of the two clouds leading to a 
reduction of the two independent white systems by the Gi^2 but which is of lower order in N than 
the Wi, H^2-particle numbers and their effect, i.e. o{N). Hence from (|3.786p we get the asymptotic 
upper bound for the mean number of Gi,2 particles as — > 00 if we condition on the growth 
constants W and W" of the two clouds. As bound we get an expression which is asymptotically 
{N — > 00) equal to: 

(3.787) ^W'W'e^"*", 

and which differs from the real system by an error term of order o( ). 
In fact we can obtain 

(3.788) limNe-^°'*"E[gn2){W{-),N,tN)]=K2E[W']-E[W"], K2 = e-°'''E[W<^'*{u)]du. 

N~^oo Jq 

exactly the same way as for p.756p . 

Remark 48 In the longer time regime ^"^"'^ + 1 the growing cloud of green particles G12 satisfies 
a law of large number effect similar as in \3.485^ , that is, conditioned on Wi, W2, the variance of 
the normalized number of Gi^2-pO'i~t'icles goes to zero as N ^ 00. 



Part 4 (Random exponentials) 

Wc next present the facts needed about the expansion of random exponentials in the following 
lemma. 

Lemma 3.26 (Asymptotics of random exponentials) 

(a) Let X he a nonnegative random variable satisfying E[X'^] < 00 for all fc G N. Let f(e,X) 
be such that 

(3.789) f{e,X)>0, f{s,X)r^e[X-SKX^ + 0{s^)X^], 



3 PROOF OF RESULTS 



155 



that is, 

(3.790) sup < CX3. 

o<e<i 

Consider 

(3.791) £;[l-e--^(^^^)]. 

Then the following two relations hold: 

(3.792) lim i = E[X], 

£->o e 

(3.793) lim = (1 + 2k)E[X% 

(h) Consider now random variables X,Y with E[Y''' + X'^] < oo for all k £ N. Let be a 
superposition of a Poisson number of independent copies of a random variable Y with intensity 
eX^. Assume now that f{e,X,Y) > and that we have: 

(3.794) f{e,X,Y)^e[X -Y,]. 

Then {3.792^ and IS.793^ are satisfied with 

(3.795) /(£, X) replaced by /(e, X, Y) and l + 2n by 1 + EY. □ 

Proof (a) Since the limits p.792p and p.793p are not affected by the 0{e^) term in (|3.789p we 
can assume that it is a constant without loss of generality. Then by the mean value theorem, 



(3.796) = D{X,e*) where D{X,e) = _ 5-"^+^ ''^ ] 

£ de 

and where < e* < e and lime_^o D{X, e) = X. Therefore we have for c > that 

1 - e- 



(3.797) E 
Then 



-eX+e'^KX'^-e^X^ 



E[D{X,e*)l^x<^)\ + ^^-P[X>^]. 



(3.798) D{X,e) = _ ^-ex+e-.x--e^x-^ ^ [(X~2e2«:x2 + 3£2x3)e-^^+^''=^'-^'^']. 

de 

Note that 

(3.799) E[ sup \D{X, e)\] < coiistE[X + X^ + X^] < 00. 

0<e<l,0<X<f 

Therefore by dominated convergence for every c > 0: 

(3.800) lim E[l,x<^) ■ D{X, e)] = E[X]. 

e— >0 ~ ' 

Since £'[X''] < 00 for fc e N we have 



(3.801) P\X > -] < E[X^] ■ (-)* 
e c 

Combining p.800p . p.BOip we conclude from p.797p the claim p.792p . 
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Similarly for the second order expansion write , 



(3.802) 



14 - e 



1 



-X] = -[D{X,e*)- D{X,0)] =D\X,e**), 



where D^{X,e) ^ ^[1 -e-^^+^ ''^ ^' ] and 
(3.803) lini[i:i2(X,e)] ^X"^ + 2kX'^ = {1 + 2k.)X'^. 



E^O 



As above we then conclude the claim (|3.793p . 
(b) We have 



(3.804) 1 - e-^(^'^''*') = 1 
and 



e ly^o - e ly=o 



(3.805) 



sup 



. 1 di 



— (1 - e^-'^^'^'^^'*'))! < const • {X + Y). 



0<£<1, Y<{ 

Therefore the relation is now obtained as follows: 

E[l- e-f^'-^^'^'^]' eE[X] 



(3.806) 



lim 

E->0 



lim 

E^O 



= lim 

£->-0 



Er[{{l ~ e-^^]) + eE[{l - e 



]E[Yl 



0<Y<i 



■sE[X] 



^e^E[X^] + e^E[X^]E[Y] + P[Y > i] 



E[X^]{1 + E[Y]). 



Part 5: (Proof of Provosition \3.2l\) 

To prove (a) (|3.710p and (|3.71ip it suffices to show that 



(3.807) < lim E[e- 

X— ^oo 



5:2 (*x)] < oo, 0< lim Var[e 

X— s-oo 



X2 (tx)] < OO, 



(3.808) limsup£;[(e-"*"x^(iAr))'^] < oo, 

X->oo 



so that the family {£(x^ (^x)) : G N} is tight and their limit points are non-degenerate and 
random since they have finite non-zero mean and variance. If we can identify the limits in p.807p . 
then we can obtain p.712p and p.713p via ([37 



The proof proceeds in steps, using duality representation of general moments in order to obtain 
first, second and then third moments of a;^ (tx)- It follows from Lemma r3.22l that we must consider 
correction terms of order in the calculation of kth moments of {t) in order to later derive 
the asymptotics of the moments of a;^ using the representation of formula p.716p . 

Recall that the basic dual relation 



(3.809) 



E[ix'^i^,t)r]=E 



1 — exp — 



N 



where 11^'° denotes the total number of dual particles if we start with a g N particles at site i S 
{1, . . . , N}. We have obtained a representation of this set of particles in terms of a decomposition 
into white, red and purple particles embedded in a multi-colour particle system of white, red. 
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purple, blue and green particles (WRGPB-system) . We have also coupled this system to a related 
CMJ process which in particular provides an upper bound, which was the WRGB-systcm. We 
now look at this system in more detail in the time interval [0, tAr] where satisfies the hypothesis 
p.709p . We first recall that the number of occupied sites in the CMJ has the form 



(3.810) n 



CMJ. a 



lim = Wa, a.s. 



Moreover the set of white and red particles in given by removing a set of green and blue particles 
from the CMJ process and we can bound the number of purple particles by the set of blue particles. 
Therefore we can represent the dual population by removing the green and blue particles with an 
error given by the difference between the blue and purple particles. Note that the blue and purple 
particles result from the event of 2 or more collisions and therefore the error term can be obtained 
by determining this probability. Moreover we have seen that the collision process is given by a 



tsWaXs}e — ^ Therefore the probability of a 
^ -) and this goes to as — >■ cxd as — >■ oo under hypothesis p. 7091 



Poisson process with intensity bounded by a{s)Wa{s)e' 
collision in [0,tAr] is 0{ 



). Let # denote the number of collisions in [0,tAr]. Moreover the probability of the event two or 
more collisions is 



(3.811) F(# >2)<Zi 



7V2 



where Zi is a random variable having finite moments. Conditioned on the presence of a collision at 
time s S [0, tff] the resulting set of green particles is given by W^^^{tN, s)e"(*"~*) where W^^^_{tN, s) 
is a nonnegative random variable with all moments and bounded by an independent copy of the 
CMJ process. 

Then using Taylor's formula with remainder we obtain 



(3.812) 



E 



1 — exp 



E 



m 

'n 



/n^-(.)d. - ^ /n^-( 



{s)ds 



2N^ 



J n^''^(s)ds 



mm 



(3.813) 



£2{N) < 



m 



m „ 
< —-.Zoe 



3o(t]s 



67V3 



where Z2 has finite moments. Calculate first the expectation of the occupation integral. 



(3.814) 



E 



m 
N 



tN 

J n^-'^(s)ds 



.0 



N 



tE 



(q(s) +7(5)) f.as, 1 /■ (a(s) +l{s)) 2„r f. N ,„ 

Wa[sje ds-— / [Wa{s)) Wg^tot[tN,Sjds 





+E[£iiN)] 
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Here the second term eomes from integrating over the event = 1} and corresponds to a green 
family produced by a single collision. To estimate the first term on the r.h.s. of (|3.812p we calculate: 



(3.815) 



E 




tE 



(a(s)+7(s)) 



Wais)e"'ds 



■EiSaiN)) 



iV3 



We are now ready to obtain study the first and second moments of iE2(ijv)- Consider a sequence 
sn ^ oo < tjy — sn ^ oo. Then using the basic properties of the CM J process p.l43p and 
p.l66p we have Wa{s) Wa,a{s) — > a and ■y{s) — > 7 (a.s. and in L^) as s — > 00. Using this we 
obtain 



(3.816) lim Ne-^^^E 



lim Ne-^^^E 

N^oo 



a{s) + -f{s) 



= lim TVe""*" I E 



a{s) + -f{s) 



Lo 



E 



a{s) + 7(s) 



Wais)e°''ds 



(a + 7) 



Wa. 



Therefore 

(3.817) ^lim^iVe-"*" [ 1 - i?[exp(-^ J U^'^'ds)] 



= lim Ne-°'*"E 

N^oo 

(3.818) m*E[Wa] 
where m* = iiil. 

etc 

In the same way we obtain 



(3.819) lim N^e' 

N—^oo 



1 - E 



^2l , (rn*)' 



N 



cxp I / Uf'^'ds I - ^ / W^^'^ds 



N 



jN., 



Step 1 First moment ofx2{tN) 
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Note that by exchangeability 
(3.820) E[x^{t)]= NE[x^{t)]. 



Then 
(3.821) 



E[x'^{t)]^N-NE[x'^{t)] 



Using the above calculations we have 
(3.822) E[x^{tN)] = NE[x^(tN)] = NE 



m 



Therefore using p.817p . we obtain 
(3.823) lim i;[e-"*"xf (tjv)] = m*E[Wa]. 
Step 2 Second moment ofx2{tN) 

We will focus next on the calculation of the second moment. We follow the method used above 
but leave out some details. 

To compute the second moment of {t) note that 

E[{x^{t)]^) = 



(3.824) 
and with i 7^ j 



2NE[x^{t)]+E[{x^it)y] 
2N^E[x^{i,t)]+E[{x^{t)f] 



(3.825) E[ix^ it))'] = NE[{x^ (*, t))'] + iV(iV - l)E[x^ t)x^ (j, t)]. 

Since the formula contains 0{N') factors, in order to compute the r.h.s. up to order 1, as — > 
00, it is necessary to include corrections of order in both E[x^{i,t)] and Elx'^^ {i,t)x^ {j,t)], 
etc. The first moment expansion we have done in Step 1. This means that we have to consider 
corrections due to possible intersections of different clouds (clouds meaning the descendants of one 
of the initial individuals.) 

Consider two clouds starting with one factor at different sites in the dual population which 
occupy the following number of sites respectively 

(3.826) VFi.Are"*" and M^2,Are"*", as ^ 00, 

with tpi as in (|3.709p and where Wi^N = Wi{tN) — > Wi as iV — )■ 00 for i = 1, 2. Then the rate at 
which the first cloud produces particles that migrate onto one of the sites occupied by the second 
cloud and symmetrically is as A'^ — > 00: 



(3.827) 



2(a + 7)^ W^i.AfW^2,Are^ 
c TV 



2(a + 7)^ W^iVFze^ 
c N~ 



If the two clouds have an intersection, coalescence can reduce the number of particles compared to 
{Wi + VF2)(a + 7)e"*" and this produces a correction term corresponding to the 6*1.2 particles. We 
have analysed this correction term in all detail (in the proof of Proposition 13. 24|) using a coloured 
particle system but here using that construction and its consequences, recall p. 739113. 74ip . we get 
a result on the behaviour of the number of dual particles and can perform moment calculations. 
We use these results to calculate covariances for i 7^ j, namely, we use the dual and expand 
the time integral using (|3.743p with s = to represent Ilu'^^'^^ for u G [0,tAr]. We estimate 







t{W^^'{-),N,u)du] by ^E[{W^ 



and p.74ip to bound the error term. 
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If ijv satisfies the above growth assumption then as A'^ — >■ cxj wc can expand the exponential 
and again verify that we can take expectation in this expansion, as in (j3.823|) . (Recall Lemma 13.261 
for the justification of these expansions.) Then using again (|3.809p we calculate as follows. 

= E re"'Sr[(W'l+W2)(a+7)(e°*"-l)-/o" BtataWi+W2)i-).N,u)-e''*Ndu+o{N-^)] 

(3-828) 

= 1 - ^E[{W, + 1^2)](« + 7)(e"*" - 1) + ^^EiiW^'^)^]^ 



+ %^{E[Wi+W2]re'-'^ +0 

We have as iV ^ cx) for i j: 
(3.829) E[x^{i,tN)x^U,tN)] 

~ 1 - —E[Wi + W^2]e"*" 



where W'^'^ is the growth constant if we start with two particles at one site. Similarly we get 



(3.830) E[{x^{i,tN)) 



m 



- 1 - —E\W^'']e 
Furthermore note that 

N N 

(3.831) Var (^^(^, 1^)) = ^ Var(a;f (^, tn)) + Cov(xf (^, ^at), (j, 1^)). 

4 = 1 i^j 

Then by (|3.825p combined with p.809p and p.828p the total mass of type 2 satisfies for iV ^ oo 
by expanding the exponential (below i ^ j): 

E[{x^{tN)f] = _ 2NE[x'^{tM)] + E[{x^{tN)f] 

= iV2 _ 2N^E[x^{i, t)] + iVii;[«(z, tN)f] + N{N - l)E[xf [i, tjvX(j, ^tv)] 

(3.832) -A^^ + 2A^m*£;[W^i]e"*" + iV(l - i^i;[W^(2>i)]e"*«) _ m'^E[{Wif]e^°'^'' 
+N{N ~ 1)(1 - l^2E[Wi]e"'« + (^IC^i+w^^f ^ 

+ i7V(iV - iy-i^E[{Wi + Ty2)^]e2"*". 
Therefore we get 

(3.833) ( r.h.s. ([5:55^ ) 

2E[WiW2\K2m*e^"^« - m* E[W^'^-^'^]e°'^'' 
+m*2(£;[M^i])2e2"*« + o(l), as TV ^ cx). 
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This implies 

(3.834) lim (e-2"*"£;[(J^(tjv))']) = {2K2m* + m*^){E[Wi])\ 

TV— S-OO 

(3.835) lim {e-^°'*"Var[{x^ (tN))]) = 2K2m*{E[Wi]f. 

N—^oo 

Wc now consider the case with two times In and +t. Following the same steps as above we 
can show that 

(3.836) lim {e-^"'''' E[{x^ {tN))x^ {tN + t)]) = (2K2m* + m*^)iE[Wi)]^e''' . 

N^oc 

Step 3 (Third moments) 

We can proceed again as before, expand the a;^ (i) in terms of [t) and apply the dual 
representation again. Here however we only need an upper bound and therefore we can bound the 
total number of dual particles from above by the ones of the collision-free process. Then the claim 
follows again easily by using p.l47p . 

This completes the proof of Proposition l3.21l 



3.3.6 Asymptotically deterministic droplet growth 

We now return to the analysis of the beginning of the evolution, where the droplet growth starts. 
Over time periods which are large but remain much smaller than as iV — >■ cxd the total 

mass of type 2 is expected to grow deterministically as an exponential with rate a and constant 
(random) factor. The random factor developed over a short time period at the very beginning of 
the evolution. This leads to the idea to focus here in a first step on the case where after some 
time to "we turn off the mutation. The point is that the mutation which enters later than some 
large time to is not relevant for the growth of the droplet anymore if we choose to sufficiently large 
which we show in the second step. 

In this section we therefore compute the moments of (t) conditioned on the configuration at a 
fixed time to, that is, given a specific realization of the random configuration (*, to), i = 1, . . . , A'^, 
then identify the asymptotics as first N oo and then t — >■ oo. In the Corollarv l3 . 281 and Corollary 
13.301 and then later also in the next subsubsection in Proposition 13.291 we will then consider the 
joint limit of ^ cxi, t — > oo, for the quantity e~"*a;^ (t), more precisely e~"*"x^ {In) for N ^ oo 
with tjv t oo and limsup(tjv/ logiV) < a. Those quantities converge in probability to a random 

limit. 

In order to mimic the situation, we consider a process, where the rare mutation is turned of 
after time to for times t > 0, we consider a process starting in finite mass and m ~ for all times. 

Proposition 3.27 (Deterministic regime of droplet growth) 

Assume that at some time to > the distributions of {x2 (to, i) : i = 1, . . . , N} for the system 
U.S\} . Jt.^p are converging to the one of {x2{to,i),i € N} with X]i^2(to,i) < oo a.s. 

Moreover we assume that for every N and for the limit configuration we have that V £ > there 
is a finite random set (we suppress the dependence on N in the notation): 

(3.837) lis) = lie, N) of k = fc(£, N) sites, 
such that for every N > 2: 

(3.838) ^2{io,i)<e. 
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Assume 

(3.839) m = fort>to. 

We set (recall forU{t, ■) and (MEW' P^?^ forU{oo, ■)): 

OO OO 

(3.840) g{t,x)^{l-Y,{l-xYU{t,i)), g{^,x) = {I -Y,{1 ~ xYU{^,l)). 



1=1 



Then the following four convergence properties hold: 
(a) 

(3.841) lim e-"*£;[:i™'*°([0,l])] = lim e~°'* \im E[x^ (t)] 



i=l 



(b) 



(3.842) lim lim {E[e-°''^'+''^x^ {t + s)] - E[e-°''x^ {t)]) =0, Vs G R, 

i— s-oo N^oo 



(c) 
(3.843) 



lim lim e-°'^''+'^*^Cov{x^{t + s),x^{t)) 

t— >oo Af— >oo 

= (i?[W^])2 2^2j:gi^,xito,t)) 



, Vs e M, 



anrf 



(d) 



(3.844) lim lim 



jg-a(i+.) + s)] - e-"*£;[J^(t)]) ' 



= 0, Vs>0. 



(e) Now assume that m > for all t > to (recall h3.839\) . Then the conclusions of (a)-(d) 
remain valid. □ 



We next verify that we can compute the simultaneous limits N ^ oo, t ^ oo for times 
<Ar(/3, t) = (/3/a) log + t, < /3 < 1. 

Corollary 3.28 (Droplets of size grow deterministically in p) 

Consider again the case where m = and the initial state is as in Proposition \3.2T\ Choose j3 
with < /3 < 1 and let 

(3.845) t,{p,t) = ^-^^^+t. 



a 



We have 



(3.846) lim lim E 



(e-"*^^(i) - e-"*"(^'")j^(tiv(/3,0)) 



0. □ 



This result shows that the randomness in the droplet growth is generated in the very beginning 
at times of order 0(1) as iV — >■ oo. In particular we can then show that, conditioned on the total 
mass of type 2 at time tAr(/3i,0), the total mass at a later time tAr(/32,0),l > /32 > /3i > is 
deterministic in the limit — > oo. 
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Remark 49 The limiting total droplet mass dynamic as iV —> oo becomes deterministic in the 
following sense. Assume that {a;^(i,to), i = l,...,iV} is a measure with total mass ana, 
supported on a set of size aN^ for some a G (0, oo) and /? G (0, 1). Then we can show that for 
t > to we have: 

.on.^N Var[x^it)] 



Remark 50 Given a fixed initial measure {x(to,i),i G N} as in the statement of Proposition \37^ 
we can also prove that as t ^ oo 



(3.848) lim E[Y^{x^{^,t)f\ = 0( lim i?[^(a;^(z, t))]). 



% i 

This is analogous to the Palm picture 113. 686]) and implies that the mass of type 2 is clumped on a 
set of sites on which the mass is almost one. 

Proof of Proposition [3T271 The proof proceeds by using the dual representation of moments 
and the multicolour particle system introduced in Subsubscction [3.3.51 to analyse the expressions. 

(a) We determine the contribution to the first moment coming only from the initial mass in 
T{e). The result then follows by letting e — > 0. 

Without loss of generality we use (recall k = fc(£, N),I{e) = I{e, N)) 

(3.849) 1(e) = {l,---,fc}. 

Note that since we are now not working with a spatially homogeneous initial measure, we must 
modify the dual. In particular we eliminate the rule that single particles at a site do not move. In 
the calculation of E[x2 (t)] we compute 

N 

(3.850) E[x^{t)]^Y.^^'^2U,t)]- 

i=i 

To compute E[x^{j,t)] we start the dual with one particle at site j G {1, . . . ,N}. However the 
number of particles at this site will return to 1 after at most a finite random time and therefore 
after some finite random time the single particle will jump to a randomly chosen site and site j 
will become empty. Therefore for large times t the value of E[x^ {j, t)] is independent of j (even if 
it is in X(e) = {1, . . . , fc}). 

We calculate the expectation of type 1 in a site j using the dual starting with one particle at 
site j. 



(3.851) E[x'{' (tj)] = E 



- 5?^ (io,*))^" 



We first consider the initial configuration where we have 
(3.852) (to, i) = for every i ^ X{e). 

Recall that here U^{t,£) is the frequency of occupied sites that have i particles. 
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Hence noting that there will be a Bin{K^ , -^) number of hits of the fc-set 1(e) by the dual 
cloud and the frequency of sites in the cloud with £-sites is given by U'^ (t,£) and the contribution 
of such a site by (1 — {tQ,i)Y, we get for sites j ^ {1, ■ ■ ■ ,k} that: 



(3.853) E[x'^itJ) 



E 



1 1 T^'N °° 



(1 — +(l — ) 

^ ^ 



i=l l=\ 



^ n' 2iV2 

A: oo 
il,i2 = l«i,«2 = l 

+oiN-^)- 

Note that as — )• oo, U^{t,£) U{t,£), with U{t, •) the size distribution of the CMJ process at 
time t. 

We rewrite the formula (|3.853p as follows. Let 

oo 

(3.854) g^{t,x) = l-J2i'^-^yU^{tJ) 

e=i 

and 

oo 

(3.855) g^^^t, a;) = 1 - E(l - xY {U"" {tyu^ {tW). 

i=i 

Note that ,g^'^ do depend as well on the initial state of the dual process via = U^''^^'^\ 
this dependence we suppress here in the notation. 
Then: 



(3.856) 



E 



EE.- a'^it, ^2 {to, *i))5^(i, x^ih, ^2)) 



■Y,g^■\t,x^{t,,^)) 



+ 0(iV-2). 



Note that because of the convergence of the dual particle system to the collision-free McKean- 
Vlasov dual we have: 

oo 

(3.857) g^{t,x) ^ 1 - ~ xYu{t,l) as iV oo, 

1=1 

oo 

(3.858) g^'^(i, x) — ^ 1 - E(l - xYU*^{t, I) as N oo. 
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Therefore we can conclude from equation p.856p that for ie{l,2,---,fc}: 



(3.859) E[x^{t,i)]^E 



Kl 

N 



4=1 



o(iV-i). 



Consequently we calculate: 



(3.860) E[x'^{t)]^E 



1=1 



0(1). 



Then by Kt as iV ^ oo, (|3.857|) and dominated convergence we get: 

A: 

(3.861) W-m E[x^ {t)] = E KtY g{t,X2{h,i)) 

L 4=1 

where we define g{t,x) using p.857p by 

oo 

(3.862) g(i,a;) = l-^(l-a;)^Z^(i,£), 

e=i 

together with the fact that Kt,U{t,-) can be identified as the number of occupied sites and the 
size distribution for the McKean-Vlasov limit dual process obtained above in Subsubsection [372.41 
(Step 2). 

Finally putting everything together we have: 



(3.863) lim(e-"* lim E[x^ (t)]) ^ E[W]y^ g{oo,x{to,i)). 



i=l 



Finally we have to remove the restriction on the initial state being outside I(e, N). We note 
that the system where a;^(0) is different from zero only at X(e) and the true system differ at time 
t at most by Const ■ ee"* and hence we get letting e (in 1(e)) finally the claim of part (a). 

(b) The claim on the first moment is immediate from (a). 

(c) We now consider the second moment calculation again using the dual particle system. As 
before in the proof of part (a) we consider first the system arising by setting the type 2 mass equal 
to zero outside the set I{e) and then later we shall let e — >■ to obtain our claim for the general 
case with the very same argument as in part (a). 

First note that 



(3.864) E[x^ {t + s)x^ (t)] = E[x^ {t + s)]E[x^ {t)] + cov{x^ (t + s),x^ {t)) 

N 

= E[x^{t + smx^it)] + J2 Cov{x^{t + s, z), (xf (i, i))) 

j=i 

JV 

+ Y.Cov{x^{t + s,j)),x'^{t,l)). 

Since the last term of (|3.864p has TV (TV — 1) summands it is necessary to consider terms of order 
0(-^) in the computation of Cov{x^{^ + s, j)), x^(<, £)). Therefore the calculation is organized 
to identify the terms up to those of order o{N^) as N ^ oo, and then to calculate the contributing 
terms ofO{N'^). 
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To handle the calculation of Cov{x^ {t + s, j), {t, £)) both for j £ and for j — i = i we use 
the dual process. We introduce as "initial condition" for the dual particle system: 

(3.865) one initial dual particle at ji at time and a second dual particle at j2 at time s 

and then consider the dual cloud resulting from these at time t + s. 

Since we shall fix times and s where the ji, respectively j2, cloud start evolving we abbreviate 
the number of sites occupied by the cloud at time u and similarly the process of the frequency 
distribution of sizes of sites by 

(3.866) K^^^'^^' = i^^.0'i'0).(j2'^) and f/^-^i-^'^ = [/JV.(ji.o).02,.)^ 

Precisely this runs as follows. There will be a Bin{K^'-'^'-''^ ^j^) number of hits of the A:-set 
1(e), hence as — > oo we obtain: 



(3.867) 



E[x^{t + s,j^)-x'^{t,j^)] 



K, 



E[ n (i-^f(io,^))'^"""] 



E 



K, 



N,jl,j2 

t+s 



N ^'-N^ 



k ,J^NJl: 



k oo 



^ n' 



N,jl ,j2 ( Ty-NJl .j 



-1) 



2iV2 



OO 



«1, 42-1^1,^2-1 

{l-x^{to,^lyf'{l~x^{to,^2)Y 



+ o{N-^). 



"EE^*+"'"W(l-5^^(^o,^))^ 



i=l £=1 



Therefore if we now use the function g^{t,x) defined in p.854p but now for the initial state 



specified by the space-time points (ji,0), {j2,s) (note g 



N 



9 



, respectively = g 



N,{h,0) 



), we 



can rewrite the above expression. However there is a small problem with the time index, since our 
two clouds now have a time delay s therefore depending on which of the two clouds has the hit of 
X(e) we have to use t or t + s a.s the argument in our function g^ {■, x). Here we use therefore the 
notation t(-) which indicates that actually the time parameter here is either < or t + s and depends 
on which of the two clouds is involved. However this abuse of notation is harmless since in the 
arguments below both the times t and t + s will go to oo. 

We get, with the above conventions, from the formula (|3.867p that: 



(3.868) 



X.g^(i(.),2?^(io,*)) 



1 - E 



t+s 



+E 



t+s 



N 



1=1 



t+s 



1) 



J2 5^(^(•),5?2~(^o,^l))ff^(^(•),^2^(^o,^2)) 

il#i2 = l 



+ o(iV-2). 
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Now wc continue and calculate the covariances of (t + s,ji) and x^{t,j2) based on p.868p 
and the first moment calculations. For this purpose we can consider the dual systems starting in 
(ji,0) respectively (j2,s) and evolving independently and the one starting with a factor at each 
time-space point (ji,0) and (j2,s) but evolving together and with interaction. As we saw we can 
couple all these evolutions via a multicolour particles dynamic. This now allows to calculate with 
the first moment formulas obtained in p.85ip to p.862p and using the notation 

(3.869) s,„ = if m = 1 and s,„ = s if m = 2, 
and we obtain the covariance formula 

(3.870) Cov(.Tf(t + s,jiX(<,j2)) 



1 - E 



-E 



N,jl J2 

t+s 



N 



^5^(t(.),5?^(0,z)) 



1=1 



t + S 



t+s 



-1) 



2iV2 



m— 1 



J2 <?^(^(•),^2^(^o,^l))5^(^(•),2?^(0,^2)) 

il#l2 = l 

K^'-'"' (t + Sjn) v-^ TV, —AT/. • \\ 



t + Sr. 



t+s„ 



1) 



k 



2iV2 



X! X! 9^{.t + Sm,x^^-^)g{t + s 



-o{N- 



We now have to distinguish two cases, namely ji ^ j2 and ji = j2 and we argue separately in the 
two cases. 

From (|3.870p . using p.859p and Proposition 13. 24i we get that if ji ^ jz, then 

C0v(.T^(t + S,Jl),X^(t,j2)) 



(3.871) 



The expression for ji = j2 is a slight modification of this, since either we have as first step a 
coalescence and are left with one particle or we get first a migration event and get a ji ^ j2 case, 
or finally we get a birth event first. Since we have the time delay s of course coalescence can act 
only with this delay. 

Therefore combining both cases up to terms of order o(-^) as — >■ oo. 



(3.872) 



CoYix^{t + .s),x^{t)) 



E[N{K^4^ + K^^^^ - K^4li^nYlg^{t,x^{toM \ + o{l) 



Recall that g'^ (t, X2 {to,ii)) g{t,X2{to,ii)) as TV 
identify the behaviour of 



00. Therefore the main point is to 



(3.873) N{K^i^ + - K^^'^), as N 



00, 



which is N times the difference between two independent dual populations each starting in ji and 
j2 and one combined interacting system. In order to evaluate this we must determine the effect of 
the collisions of the dual populations. 



3 PROOF OF RESULTS 



168 



In order to analyse the dual representation of the r.h.s. of equation p.864p rewritten in the form 
p.872[) we will again work with the enriehed coloured particle system which allows us to identify 
the contribution, on the one hand of the two evolving independent clouds, and on the other hand 
the effects of the interaction between the two clouds by collision and subsequent coalescence. 

The construction of the coloured particle system (observing Wi, W2, Ri, R2, -R1.2, Gi, G2, Gi_2 
particles which are relevant for the accuracy needed here) was given above in the sequel of p.759p . 
Recall that the Gi^2 particles are produced in the coloured particle system from collision of Wi and 
W2 particles, that is, Wi and W2 particles at the same site which then suffer coalescence. Such an 
event reduces the number of occupied sites in the interacting clouds by 1 and then creates a Gi^2 
family. Then the number of occupied sites in the Wi , W2 interacting system K^'-'^ '•'^ is given by 

First we observe that if we evaluate the duality relation (recall the form of the initial state we 
consider here) we see that the contributing terms of order 0{-^) arise from either double or single 
hits of the dual particle cloud of a point in I{e). 

We note that in general there are two cases which must be handled differently, namely, the case 
in which the difference in (|3.873p is based on a Poisson collision event with mean of order 0(-^) as 
in Lemma [3.261 (b) and the other in which there is a law of large numbers effect as in (|3.485p and 
Lemma r3.26f a). However in both cases the expected value used in the calculations below have the 
same form. In the present ease for fixed t we are in the Poisson ease with vanishing rate so that 
at most one Poisson event contributes to the limit. 

Using Proposition l3.24l p.742p we se that the key quantity in (|3.873p satisfies as iV — > 00 that: 

(3.874) E[N{K^4' + K^^^^ - Kf^y^] - N ■ E[q^^^2){W' {■), W"{-), iV, s, t)] 

where 0(i^2)(W^'(')j ^"(O: ■S: i) is given by p.735p and has mean given in p.745p . Hence we 
conclude with p.745p that the limit of the r.h.s. of p.874p is given by 

(3.875) 2n2{E[W]fe°'^'^'+'\ 

Remark 51 Here we have noted that using the coupling introdueed above the number of single 
hits of the k sites in 1(e) coming from either one of the two interacting clouds, can be obtained by 
deducting from the single hits of the "non-interacting Wi and W2 particles" the single hits by Gi^2 
particles in the interacting system. Moreover the event of a double hit, that two occupied sites in 
the dual cloud coincide with one of the k = k{e) sites, can occur from either two Wi sites, two W2 
sites or one Wi and one W2 site. (Since the number of Gi^2 sites or including sites at which both 
Wi and W2 particles coexist are of order 0{jj), the contribution of double hits involving G1.2 sites 
is of higher order and can be ignored.) 

Taking the limit as TV 00 in p.872p . using p.875p . that leads to 
Cov{x^{t + s),x^{t)) 

^^■^^^^ 2K2e''(^'+'^E[{W'{t + s)W"it) E g{t, 2?2(io, «)]• 

1=1 

Recalling p.862p . (|3.757p and the fact that U{t, k) — > U{(X), k) ast ^ 00, we get next as t — )- 00: 



(3.877) 



r.h.s. of (I3TO -2e"(2t+^)£; ksM^ W ELi ^2(to, «)) 



and 



e '(2t+s)Cow(J^(i + s), i^(t)) - 2{E[W])^ 



k 



J2 5(00, 2:2(^0, «)) 

i=l 
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uniformly in s. To verify the uniformity we work with the expression for the expected number of 
Gi,2 sites (in the N ~> oo Hmit dynamics), recah here (|3.753p . to get: 



(3.878) E 



t+s 



W'{u)W"{u - s)e"(""'')e""iyf (t + s, u)e"(*+"-")dw 



Then using that for the TV — > oo hmiting dynamics always W{u) W as u oo and showing 
that the contribution from the early collisions (in [0,s + so(i)]7 so{t) — o{t)) form a negligible 
contribution as in the argument following (|3.757p we obtain 



(3.879) 



t+s 



Wi{u)W2{u - s)e"("-'')e""W^f (i + s, u)e°'^*+''-^Uu 



-{E[W]f 



1^2^9(00, X2ito,i)) 



This proves p.843p and completes the proof of (c). 

(d) follows immediately from (c) and (a). 

(e) To verify this note let X2'"^'^*°'°°\t) denote the contribution to the population resulting 
from rare mutations that occur after to- Then 



(3.880) Eie-"* lim 



N^oo 



(t)] < -e 
a 



-ato 



Since the above results are valid for any to, we can let to — > c», to conclude that they remain true 
if we do not turn off the rare mutation at time to. 

Remark 52 In the above calculations we have included double hits - that is two clouds including 
the same site and that conditioned on a hit the number of particles at that site is given by the size 
distribution and therefore converges to the stable size distribution as t 00. 



Proof of Corollary [mi 

Again we use the dual representation of the moments and analyse the dual based on the enriched 
coloured particle system. We consider separately two cases < /3 < 1/2 and 1/2 < /3 < 1, which 
correspond to the distinction whether two clouds of dual particles descending from the two initial 
particles do not meet as — > 00 in the {(3 /a) log N time scale with a nontrivial probability or 
whether they do next. 

The proof in the case < (3 < 1/2 (where the clouds do not meet asymptotically) follows the 
same lines as for Proposition 13.271 only that now in (|3.829p the limits N —i' 00 and s — )■ 00 are 
carried out at once. Hence the additional step is to verify that the error terms remain negligible 
when s is replaced by sat with 

(3.881) SN ^tN{l3) ~t. 

Now we have to analyse the first and second moments in this time scale. 

To prove the claims we first note that for /3 < 1 the probability that the particle system hits a 
given site still goes to as — > 00. Therefore with p.860p we still can apply the CMJ-theorem 
on exponential growth which takes care of the first moment term asymptotic. 

Moreover, the remainder term in the calculation of the second order term s, i.e. of 

(3.882) £;[e-"*xf(t,ji)e-"*~('^)<(tjv(/?),j2)] 
has the order 

(3.883) 0{—^)=o{N-'). 
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This gives the claim. 

The additional consideration in case ^ < /3 < 1 is that there is a positive probability that two 
clusters will collide in this time scale. However the question is whether the collisions with their 
effects can change the occupation integral appearing in the dual representation in the exponential. 
We see that since the number of those sites at which collisions occur among the total of the N 
sites is o{N) we do not need to consider higher level occupation. 

3.4 Relation between *W and W* 

Let us briefly review the two limiting regimes we have considered. In the first regime wc have 
considered the limiting droplet process 2"^ which is a A^a([0, l])-valucd Markov process. We have 
also proved that there exists a* > such that 

(3.884) {e-"**nr([0,l])}t>o is tight 

and determined the limiting first moment and variance of e~" *II™([0, 1]) as < — od. This has been 
obtained by considering the process 3^ '™ for times in [0, tjy] with — >■ oo, tj^ — ~> — oo and 
letting iV — > oo. Moreover by (|3.844[) it follows that ll™([0, 1]) is Cauchy as t — > cx) and therefore 
has a limit W* in as i oo. 

On the other hand there exists a > such that the empirical process dx) in time 

window { '°^^ +t}t^M converges to Ct, and the limit *yV = limt_>_oo e~"* xCt{dx) exists and is 
random. 

Next recall that we have established that the two exponential growth rates a and a* in the two 
time regimes ^at t oo, In — o(log N) as — > oo respectively = ^ log 7V + tas7V— >-oo,i— > — oo 
are the same, so that in that respect limits interchange. 

It is then natural to conjecture that also the distributions of *yV and W* are the same even 
though there is the following obstacle. Define 

(3.885) tNiP.t) = -\ogN + t, with < /3 < 1. 

a 

Consider the law £[a;^ (ijv(/3,i))] and note that for fixed t there is a discontinuity at /3 = 1 since 
then collisions of the dual occur at order 0(1) and we know that as function of /3 we jump from 
type-2 mass zero for /3 < 1 to a positive value at /3 = 1. Hence the matter to relate *W and W* is 
very subtle. 

In order to investigate the relation between the laws of *yV and W* we next consider the 
relations between the dual process at times corresponding to the droplet formation and the dual 
in the time scale corresponding to the macroscopic emergence. In this direction we verify first that 
the first two moments of W* and *>V agree and then use the techniques of proof to conclude with 
L2-arguments that the two random variables have asymptotically as — > oo, i — > +oo respectively 

— > oo, i — > — oo L2-distance zero. 

Recall the notation tAr(/3, i) = /3q;^^ logiV+t , ijv = ct^^ log A^, which we shall use in the sequel. 

Proposition 3.29 (Relation between *yV and yV* ) 

(a) For Q<I5<1, 

(3.886) lim lim E 

(b) We have the relation: 

(3.887) C[*W] = C[W*]. □ 



logA^ 



e-"*"(^'0)j^(tjv(/3, 0)) - e-"(^+*)£^(^^ + t)) 



= 0. 
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Remark 53 We can summarize the results in US. 886]) as follows. Consider for < /3 < 1, with 
^2 (^) "1^7 quantities: 

(3.888) {^-(t) : < t < {.-(i^ + : i^^^li^ < t < 0}. 

a a a 

Then we have shown so far that these two processes converge in law to 

(3.889) {x2{t) ■.0<t< oo}, {x2{t) : -oo < t < 0}, respectively. 

We can, by the classical embedding theorem of weakly convergent distributions on a Polish 
space, construct both sequences and their limits on one probability space (57, A, P) preserving the 
law of each random variable in question and such that we have a.s. convergence in the supnorm 
on compact time intervals. On this probability space we consider L^{il,A,P). 

Then we have: 

(3.890) W* =L^- lim e-"*X2(t) = _ lim e-"*X2(i) =* W. 

Corollary 3.30 (Equality of moments for growth constant) 

(3.891) E[{*W)''] = E[{W*)''], fc = l,2,.... □ 
Proof of Corollary 13.301 This follows from Proposition 13.291 

Remark 54 We can calculate the moments of*yV and W* by explicit calculation. However since 
we have not proved that all moments are finite we cannot use this to prove the equality in law 

(Em). 

Proof of Proposition 13.291 

We start with the rough idea. We have calculated yar[yV*] in (|3.835p which involved the time 
scales smaller than a~^logiV. An expression for yar[*yV] was derived in p.617p . We see that 
Var[W*] = Var[*yV]. We shall now prove p.886p which will give (|3.887p . The point is that the 
moment calculations now have to include time scales up to a^^ log and must therefore different 
from before treat collisions of the dual cloud for a given point occuring with positive probability. 

We proceed in five steps. We again start by considering the case without mutation but start 
instead with positive but finite initial type-2 mass. We calculate in Step 1 first moments, in Step 2 
second moments at the given time, in Step 3 we calculate the mixed moment between the two times 
specified in the assertion under our assumption. In Step 4 of our arguments we include mutation. 
In Step 5 we conclude the proof. 

Step 1 We begin with the first moment calculation. Consider the calculation of the expected 
value using the dual particle system starting with one particle at j and recall 1(e) from (j3.849|) . 
We will assume that at time wc have an initial distribution as in Proposition 13.271 and that 
furthermore m = 0. As above there will then be a Bm{K^^^^ t)^^^ distributed number of hits of 
the fc-set 1(e) by the cloud of dual particles. To apply the Poisson approximation recall that by 
([3J38l) - ((37iT|) we have: 

(3.892) iff'^'^) = Wa{t)e''' - Q{Wa{-),N,t) + £a{N,t), 
where the error term satisfies 

J2at 

(3.893) E[£a{N,t)] < const—- • supE[Wg(u)]. 
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Therefore the number of hits of the fc-set converges as iV — > oo weakly to a Poisson distribution 
with parameter We"'* — K2W^e^"* + ©(e'^"*). Therefore from (|3.867|) we get looking at 0,1 or 2 hits 
of T(e) and denoting by 

(3.894) {e,, £=l,...,i^,^(i^,)} 

the states of the sites occupied by the dual at time t]y{l,t) and by , the number of occupied 
sites and the size distribution for the finite N dual (nonlinear) system studied above: 



(3.895) 



E[x'^{tM{l,t),j)]^E 



E 



K 



n {i~x^{to,ce)) 



r (t«(i,t),4f) 



(1 )'^t^(i,t) 




n' 


-^J:T.u''itNil,t),e){l-x^{to,^)Y 

1=1 1=1 




^t«(l,t)(-^t«(l,t) ~ 1) 


2iV2 



E U''{tN{l.t),ll)U'^{tN{l.t),£2){l-X^{t0.ll)YKl-xUt^,l2)Y- 



il;i2 = i ii ,^2 = 1 
r-2\ 



We rewrite this in the form 
(3.896) E[x^{tNil,t),j)] 



E 



1 - (W/Ar(tAr)e"* + 0(W^2 (^^)g2at)) J- 5'^(tw(l, i), (to, l)) 



WN{tN)e 



2at 



k k 



E E a;»i)5'^(iiv(l, t), 



ii=l 12 = 1 



+ o(A^-2), 



with 



(3.897) g^{tN{l,t),x) = 1 - ^(1 - x)^f/^(t^(l,t),f). 



Therefore as — >■ oo, we have (recall that WN{tN) W a.s. as — >• oo and M^7v(ijv) are 
uniformly square integrable): 



(3.898) E[x^{tN{l,t),j)]=E 



(VFe"* + OiW'e''^')) E g'^itNiht), x^{to, *)) 



-o{N- 



Next we need to know more about {tN(l, t), a;^ {to, i)) as iV — > oo and then t — > — oo. Recall 
that g^{tN{l,t),x) = l-Efci(l-2;)^;7^(t,£) ( see 15^51) . limN^ooU'^ {tNil,t),-) = U{t,-) (see 
[Xn]) and limt^_oo C/(t, ■) = U{oo, ■, ■) [sce^MM- Therefore 

OO 

(3.899) lim lim 5^(tAr(l, i), J^(to, i)) = 5(00, i2(to, «)) = 1 - V(l - J2(io, £). 
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This allows to calculate next asTV— >-oo,i— >— oo: 

k 



(3.900) (tAr(l,t))] = £; 

Then 



W^iVe"*^5^(t^(l,t),xf(to,*)) 



1=1 



+ o(e"*iV-i). 



(3.901) hm lim £;[e-"*"(i'*)j^(tAr(l,t))] = 

t— > — OO Af— >CXD 



Finally putting everything together results in 

k 

(3.902) lim lim £;[e-"*"(i'*)(x^(tA,(l,t))] = S[iy]y 5(oo,J(io,i))- 



Step 2 We now consider the second moments. Consider the time points iAr(/3i, 0) and fiv(/32, i)j /^i < 
(32- The case /3 < 2 follows as in Proposition 13 . 271 The case (3 = 1 requires more careful analysis, 
we carry out now. Let < /3 < 1, then 



(3.903) 



(tw(/3,0),ji)-xf (tjv(l,t),j2)] 



Sjv(i.*) C(*iv(l,t)) 

j=l (=1 



E 



K. 



(I )^t„(i,t) _|_ 



N,jl,j2 

tjv(l,t) 



k oo 



-fl Y^t„{i.t) 



^5:^C/f(^)(l-J^(to,*))^ 



i=i e=i 



k oo 

5] 5] C/^(t,4)C/~(t,4)(l-5;f(to,»i))'^(l-5?2~(to,*2))' 

ii, 12 = 1^1/2=1 
+o(iV2)^ 

Proceeding as in the proof of Proposition 13 . 271 in the case where ji 7^ j2 we get: 
(3.904) 

C0V(X2~ (tAT (/?, 0) , J 1 ) , (tAT ( 1 , t ) , j2 ) ) 



1 



V^tiv(l. 



is: 



w,i2 



if 



N,jl,j2 



t) ' -"■tjv(/3,0) -"-tNCl.O^tjvC/J^O) 



^g^(t^(.),5?2~(to,*)) 



i=l 



+o{N-^). 



Step 3 The next point is to prove the following claim on second moments for iV — > 00 and then 
as t -00. For < /? < /32 < 1, 



(3.905) lim lim £; [fe-"*"('3'0)xf (tjv(/3, 0)) • e~"*"(''='*)xf (ijv(/32, i))) 

t^ — ooN^oo l\ 



const. 



with a constant independent of /3,/32. 

In order to verify p.905p we must now include new considerations that arise since e"*"(^'*) = 
Ne"* leading to a positive particle density and hence collisions. Recall our representation in terms 
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of the coloured particle system and a careful analysis of the number of Wi, W2 and finally Gi_2 
particles. We begin with the Wi and W2 particles. 

The key point is that in the analysis of the K2 term one needs to include slower growth of the 
Wi and W2 particles due to the nonlinear term. In particular the populations of white particles are 
asymptotically as iV — ?> 00 of the form NF{t) and the function F{t) can be expanded as t — > —00 in 
terms of fc-th powers (fc = 1, 2, • • •) of e"*. Hence the number of particles in the two white clouds, 
the one starting in ji and the one starting in j2 are each asymptotically as iV — ^ 00 behaving as 

(3.906) W^e"*"('^'*) - ^K2VF2g2at„(7,t) _ WNe°'* - K2W^Ne^°'* = Ne"'*W{l - KzM^e"*), 

where we take two independent realisations of W for each cloud and with 7=1-/3. The system 
with start of a particle at ji at time and j2 at time iAr(/3, 0) evolves similar but now an additional 
correction is needed because of the collisions between the two clouds. This effect is represented by 
the G'i_2 particles. 

Hence the important term for the covariance calculation is the one involving the Gi^2 particles. 
The number of G1.2 particles is taking again — > 00 and then afterwards t — > —00 of order 

(3.907) 0{W'W"K2e'^°'*). 

The self-intersections of the Wi and the 6*1^2 particles produce only a higher order correction to 
the number and size distribution of the Gi^2 particles. As a result in the limit as t — > —00 the 
correction terms are all of higher order. 

From together with p.904[ I3.906P and p.907p we can verify that the covariance term is asymp- 
totically the same as (|3.843p . that is, 



(3.908) lim lim cov(e-"(*"(^'*»xf (tAr(l, i)), e-^^^^/^'O^xf (t7v(/3, 0))) 

t— > — 00 N^oo 

= 2iE[W])^ |Aj2 5^g(cx3,2?2(io,«))| ■ 

and we get the claim in p.905p . 

Step 4 We are now ready to conclude the argument for the two assertions of the proposition. 
First we have to show the initial masses we used in the calculation above can in fact arise as 
configuration at time tg. This follows from (|1.76|) . where we show convergence of the J™'^ to 
(recall (|1.59p '). Second we have to prove that the case we treated can be replaced by the masses 
which arise from mutation. For any fixed these masses satisfy p.837p . p.838p and the above 
argument gives therefore the result for mutation turned of after time tQ. We now have to argue 
that this is not relevant for large to. 

The idea is that the contribution to W* due to mutation after to is negligible for large to, more 
precisely, the expected contribution is 0(e^"*") as tp co. The result then follows by letting 
to ^ 00 in Corollarv 13.311 below. 

Step 5 Next we have to go through the list of claims. 

(a) This follows from p.908p above, p.846p together with p.843p and p.905p for /3 = ^2 = 1 
since we see that both the variances of the two random variables and their covariance all converge 
to the same value. 

(b) Follows from (a) by using the construction of Remark[53] (W*,* W) both on one probability 
space) and noting that with tjv(0,t) = t 

(3.909) e-"*"(i^*)S^(tjv(l, t)) = e^'x^ {t^il, t)), 

(3.910) W* = lim lim e~"*"("^*)Sf (tjv(0, t)) 
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and 

(3.911) *yV= lim lim e~*x^ (tN{l,t)). 

This completes the proof of Proposition l3.29l if we could show that in (|3.91ip we can replace tAr(0, t) 
and t oo by t^r with tjv — > oo but ~ o(logiV). Hence the proof of Proposition 13.291 fohows 
from the next corollary. 

Corollary 3.31 Consider t^r f oo with = o(log A^). Then: 

(3.912) lim lim £;[(e-"*J^(t) - e-"*"x^(ijv))^] = □ 

t— >oo Af— >oo 

Proof of Corollary [33T] 

If we consider the case in which we exclude mutation but insert initially some finite mass (even 
for — > oo) then the result follows due to p.846p . What we want is however the statement where 
initially we have no type-2 mass but where we have mutation from type 1 to type 2 at rate mN~^ . 
The idea now is to use the fact that only early mutation counts, that is, mutation after some late 
time has a negligible effect. We can use this idea together with coupling to prove the result with 
mutation. 

We consider solutions of our basic equation for mutation rate m and mutation rate 0. We 
denote the solutions by 

(3.913) (Xf ''"),>o and {X^%>o. 

We can define these two processes on one probability space so that the following relation holds: 

(3.914) °(t) < x^'^\t), 
with 

t 

(3.915) E[x^''^(t) - x^'°(t)] < Const ■ m J e°"'ds. 



We now apply this to compare the process with mutation turned off after time to with the original 
process with mutation running on [0,oo). We know a;^'''(to) — 2?^'™(to) — 0(e"*")) and with 
p.914|) and (|3.915l) we have 

(3.916) < £;[e-"(*°+*)Sf "(to +t)- e-"(*+*°)Sf °(to + t)] < Const ■ me""*". 

Letting to ^ oo we get that we can approximate the systems with mutations by ones where 
mutation is turned off after to uniformly in N. Then the claim follows by using the result for 0(1) 
initial mass of type 2 and no mutation. 

3.5 Third moments 

We expect that the random variables >V*(*>V) have moments of all orders which determine the 
distribution. We do not verify this here but in this section we verify that the third moment of the 
rare mutant type is finite given if we start with total mass of type 1 only using the same method 
as that used for the first and second moments which for convenience we review here. 
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Lemma 3.32 (First, second and third moments ofW*) 

Consider times i^v — > oo with tj^ — o(log A^). The first, second and third moment of the reseated 
total type two mass satisfies: 

(3.917) lim e-°''''E[{x^{tM))\=m*E\W], 

(3.918) lim e-2"*"^[(J^(fw))2] = {m*Y{E[W]f + 2m,* K2{E[W]f 

and for some < K3 < 00, m* = -1(1 + (as in \3.604^ ), K2 given by Jg. 788}^ 

(3.919) lim e-^"'''E[{x^{tN)f]^im*f{E[W]f + 6{m*fK2iE[W]f + 6m*K3{E[W]f. □ 

TV— )-oo 

Proof of Lemma [3732] 

The proof will be based on duality and as a prerequisite we need (in order to determine the 
moment asymptotics) to evaluate the expression which represents the moments in terms of the 
dual particle system. We therefore carry out the proof in two steps, first an expansion of dual 
expression and then the moment calculation. 

Step 1 (Expansion of (t) up to terms of order 0{^)) 

In this subsection we extend the coloured particle system in order to derive and expansion of 
nf^'" including terms of order and which are needed in the calculation of the second and 
third moments of x^ (t). 

Lemma 3.33 (Expansion of 11^^°') 

Consider the dual particle system starting with one, two or three particles. To obtain a unified 
expression we state the result using Wa- 

The corresponding CM J random growth constants (recall ^3.143^ Wa — Wi with one initial 

particle, Wa = Wi + W2 if two particles start at different sites, Wa = W^"^^ if two particles start at 

(2) 

the same site, Wa = Wi + W2 + W3 is three particles start at different sites and Wa = Wf + W2 
if two particles start at one site and one particle at a different site. 
Then there exists K3 > such that 

(3.920) nf/^ 



Then we get for the time-integral and the exponential of it: 

(3.921) H^it) := ^ I nf ds ^ ^(W^a)e' 




(3.922) e-^"« = 1 - H^{t) + '-{H^ {t)f '-{H^ {t)f + 0{{H^{t)Y). □ 

Proof of Lemma 13.331 We use an extension of the multicolour system defined so far in Sub- 
subsection [2231 (Step 1) therein and in the above Subsection 13.41 in order to identify up to 

terms of order e'^"*" with an error term of order 0{^j^). To obtain this we must introduce new 
colours. 



-(a + 7) 



iWa)e 



atM 



N ^ N 



3^3 



oC- 



ATS 



m K2 



o{- 
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Recall in the expansion to order e^"* we worked with the system of white, black, red, green, 
purple and blue coloured particle system and that the dual system is given by the white, red and 
purple particles. The green particles are produced by coalescence of a red and white particle and 
a green site is produced by the migration of a green particle. These represent particles which 
are lost due to one collision (followed by coalescence). The white particles grow as Wa^^^e"* 
and the green particles are produced at rate W^e^"" at time s. Since the green families grow 
with exponential rate a the total number of green particles grows like W^K2e^"*. The number of 
purple-blue particles is of order 0(e'^"*) and the the lower bound is given by the white and red 
particles obtained by subtracting the number of green particles. The upper bound is obtained by 
adding the blue particles. 

In order to identify the next order term we must include the purple particles (and exclude 
the blue particles) and follow what happens when the purple particles coalesce or migrate. This 
requires to introduce two further colours, namely pink and yellow. When a purple particle migrates 
and hits a white, red or purple site we now produce a pink-yellow pair. There are 0(e^"*) pink 
(yellow) particles where now the pink particles provide the dual and the yellow particles now behave 
as the blue particles did before in obtaining the error term. 

We note that the purple particles can coalesce with a white (or red) particle and when this 
happens a white (or red) and green (to distinguish these we label them G2) pair of particles 
is produced at the site where coalescence occurs. Green (G2) particles are produced at rate 
const - W^e^"*. 

The two particle systems we have to compare, the dual one and the collision-free one, are now 
given as follows. 

• The dual is given by the white, red, purple and pink particles and 

• the collision free by white, red, green (G U G2), purple and yellow. 
The modified rules are now: 

• If a purple migrates and hits a site occupied by white or red it dies and creates a pink- yellow 
pair which have subsequently coupled times of birth, migration, 

• the yellow particle is placed in the first unoccupied site in the first copy of N, 

• if a purple particles coalesces with a white or red particle it produces a green G2 particle 

• the pink particles behaves like a standard dual particle. 

This will allow us to approximate the dual with an error of order as: N 00: 
(3.923) 0{N-^e^"*"). 

Now we have to represent the quantity 11^'°' in terms of the numbers of particles of the various 
colours. Here we follow the procedure explained in Subsubsection 13.3.51 Part 2 therein. Then we 
obtain the claimed expansion of p.920|) . Recall that K2VF^e^"* denotes the term corresponding 
to the growth of the green G particles in subsubsection 13.3.51 In the same way the K^W^e^"* 
in p.920p term corresponds to the set of G2 particles, that is, the particles lost due to particles 
involved in exactly two collisions. 

The assertions p.921|) and (|3.922p follow from (|3.920p by explicitly calculating the integral and 
then inserting the expansion in the Taylor expansion of the exponential around 0. q.e.d. 

Step 2 (Moment calculation) 
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First recall the dual identity 

(3.924) E[{x^ {i,t))''] = E 

where is given by p.92ip and where Wa is the CMJ random growth constant (recall p.l43p l 
obtained by starting the dual with k particles at site i. We obtain a similar expression for 
E[x^{i,t)x^{j,t)], where now we start one particle at i and one particle at j, etc. 

We now compute the first three moments following the same method. Note that 

(3.925) E[{x^ (t))] ^E[N-xi (t)] ^ N- NE[xi (t)] 

= N-NE[il-m*^e'^' + 0{^))]. 
Then replacing t by tpf we get 

(3.926) hm e-"*"i?[(xf (i^,))] = lim e-*« Ar[!:^!:|!!le"*« + 0(-l)] 

and (|1.19p follows provided that tpf ~ o{\ogN). 
We now consider the second and third moments; 



(3.927) E[{xnm = 

NE[{x^it,m+N{N-l)E[{xn^,t))x^{j,t)], i^j 
and for i, j, fc g {1, • • • , N} distinct 

(3.928) E[{xUm = NElix'^ii, t)f] + iN{N - l)E[{x?{i, t)fx'^{j, t)] 

+N{N ^1){N- 2)E[x^{z, t)xf (j, t)xf (fc, t)]. 

We first illustrate the method of calculation by calculating in detail the second moment. We 
substitute p. 9211 and p.922p in (|3.933p keeping track of only those terms that do not go to as 
N ^ oo. We get: 



(3.929) £;p^(t))2]=i?[7V-Sf(t)]2 

= - 2NE[x^{t)] + E[x^{t)f] 

= -2N^E[x'^{t)]+NE[{x'^{t)f]+N{N ~l)E[x'^{i,t)x'^{j,t)]. 

We now insert for the appearing moments of x^ the formula (j3.924p and then use the expansion 
of the exponential from p.922p . We first compute the terms coming from the 1 — [t) in the 
expansion of the exponential. This gives 

„3at ^ 



(3.930) 



- 2N^E 



N 



N3 



+NE 



N 



iV2 



,3at 



iV3 



+N{N - l)£;[l -'^{Wi+ W2)e°'* + ^^^(W^i + M^2)^e^"*] 



N^{l-2 + l) + NE 



E -2m*K2W'^e^ 



{2m*We"' + 1 - 1 - m*(M^i + 1^2)6"*) + O 

■ ^(2)gat _^ TO*K2(M^1 + VF2)2e2"* + O i — j 



^3at \ 1 



N3 



= 2m*K2{E[W\fe^°'^ + m*E[Wi + W2Y 



O 



N 
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We next note that the \{H^ {t)Y gives 
(3.931) [m*f{E[W]fe^"\ 

Here Wi , W2 are independent random variable coming from the CMJ hmit starting with particles 
at two disjoint sites and W^^-* is the random variable coming from the CMJ limit starting with two 
particles at the same site. 

Combining all these terms we obtain 



(3.932) E[{x^{t)f] 

= 2{E[W]fmK2e''^' + 2m*E[W]e''' - m* E[W'^'''^\e'^' + {m*f{E[W]fe'' 




Letting t = tff, multiplying by both sides e and taking the limit N ^ 00, we obtain p.918p 

provided that = o(logiV). 

We now follow the same method for the third moment but omit writing out the numerous 
intermediate terms which cancel and get: 

(3.933) E[{x^{m 

= N^- 3N^E[x^{t)] + 3NE[{x^{t))^] - (i))^] 

= N^- 3N^E[{x^{t))] + 3N^E[{x^{t)f] + 3N\N - l)E[x^ {i,t)x^ {j,t)] 
-NE[{xff] - 3N{N - l)E[{x^{i))^xi{j)] - (A^^ _ 3^2 ^ 2N)E[x^ {i)x^ {j)x^ (k)]. 

The proof proceeds by substituting p.92ip and p.922p via p.924p in p.933p . One can verify by 
direct algebraic calculations that the resulting coefhcients of , N'^ , are zero. The 0(1) terms 
involves powers of e"*, e^"* and e^"*. To prove the Lemma it suffices to identify the coefficient of 
e^"*. The coefficient involving (m*)^ comes from the term • {H]y{t))^ in the expansion of the 
exponential. The K2 term comes from the term ^ • {Hj^it)'^ and the K3 term comes from the term 
N^HN{t) in the expansion of the exponential. 

To illustrate this structure we consider the contribution coming from the term 

(3.934) —{H^f = -{m*f^2e'''\Waf. 

We must apply this to each term in the following expression 

(3.935) 7V3[-3S(.Tf (t)) + 3E[x^{i, t)x^it, t)] - E[x^{t, <)xf (j, t)x^ik, t)] 
The corresponding terms with a (Wa)^ result in 

(3.936) - 3E[Wl]3E[{Wi + W^a)^] - E[{Wi +W2 + W^f] = -6E[Wi]E[W2]E[W3]. 
Note that the coefficients of e"*, e^"* depend on higher moments of W. This leads to 



E[ix^it)f] = m*K3{E[W]fe'^"' + 6im*)^ K2{E[W])'^e'^"' + 6(771* f{E[W])^e'^ 
+const • e^"* + const ■ e"* + O 



N 

Letting t ^ t^, multiplying by e~^"*" and taking the limit N ^ 00 we obtain p.919p provided 
that tN = o(log7V). 
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3.6 Propagation of chaos: Proof of Proposition 11.151 

Here we prove Proposition II. 151 We proceed in three steps. 

Step 1 Fix to > — oo and tagged sites 1, . . . ,i. We can again prove for Tjv = ^logN that 
(1, T]y + to)i ■ • ■ 1 ^2 (-^' "^N + ^o)} converges in distribution as iV — >■ oo using the dual represen- 
tation to show that all joint moments converge. Here of course it is equivalent to do the same for 
Xi{i,Tpf + to) which is more convenient for calculation. In particular wc argue as below p.595p 
using the dual process, except that now we start fci-particles at site i, i = 1, . . . , L to calculate the 
mixed higher moments of x^{l, T/v + ^o): ■ ■ ■ : Tjv + to), i.e. 

L 

(3.937) E[l[{x^{j,TN+to)p], fci, fca, • • • , € N, 
in terms of 

Tjv+t 



(3.938) E[expi-^ J n^C^--^-)^^)] 



using formula (|3.716p and (|3.717p and denoting with the superscript (fci, • • • , k^) the initial position. 

Then we obtain convergence as — >■ oo from Proposition 13.81 part (e). (Note the fact that 
the ki here are possibly different does not change the argument). Therefore we have proved the 
convergence of the marginal distribution at time to of the tagged size-i sample. 

Step 2 Next we verify that the limiting dynamics for t > to is as specified in (|1.106p . To do this 
wc first review some results proved earlier. Using the results from Step 1 and Skorohod representa- 
tion of weakly converging laws on a Polish space we can assume that S^^'"(to, 2) — > Ctg (2), a.s.. 

Note that conditioned on £t„(2), the path {Ct{2)}t>to is deterministic. 
Consider for i S {1, . . . , i} the system: 



(3.939) dx2 {i,t) = c{x^ (t) - x^ {i,t))dt - s x^ {i,t){l ~ x^ {i,t))dt 

+ ^d-x^{i,t){l - X^{i,t))dW2{i,t). 
Conditioned on Ct„(2), we know already for N ^ oo (on our Skorohod probability space): 

(3.940) x^{t) ^ J xCt{2,dx), a.s., for t > to. 
Since we know from Step 1 that 

(3.941) C[{x^{i, to), t = l,2,---,L}]=^ C[{xTii, to), t = l,...,L}], 
then a standard coupling argument yields 

(3.942) C[{x^ii, t)}.=i....,L. t>to] =^ C[{x^{i, t),^i L; t>to}], 

JV->oo 

where xf{-, ■) satisfies for i = 1, . . . L, t > to. 

(3.943) dx^{i,t)= c{J xCtit,dx)~x^ii,t))dt-sx^{i,t){l~x-^{i,t))dt 

+ ^d-xf{i,t){l - xf{i,t))dw2{i,t). 
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Step 3 Now we need to show that we have for the equation p.943|) a unique sohition for i G M. 

We can prove that, given {J xCt{2,dx))t£R, which converges to as t — — oo, (|3.943|) has a 
unique solution on (— oo,io] as follows. 

We can construct a minimal solution by considering a sequence (in the parameter n) of solutions 
corresponding to starting the process at time i„. Then consider the tagged site SDE driven by 
this mean curve starting at at time i„ — >■ —oo. The sequence of solutions forms a stochastically 
monotone increasing sequence as n — > oo (use coupling). Then observe that any solution x(t) that 
satisfies x{t) ^ a.s t ^ —oo must have zeros (cf. classical result on Wright-Fisher diffusions). 
Then by coupling at a zero we get that it must agree with the minimal solution. This gives 
uniqueness for a given mean path (£t(2))tgR. Note that this also implies the uniqueness of (£t)tgR 
since we have already established the uniqueness of the limiting mean curve. 

This completes the proof. 

3.7 Extensions: non-critical migration, selection and mutation rates 

The hierarchical mean field analysis allows us to investigate the emergence times and behaviour for 
a wide range of scenarios involving different parameter ranges for mutation rates, migration rates 
and relative fitness of the different levels. Our main focus in this work is the critical case in which 
mutation, migration and fitness all play a comparable role and for this reason we have chosen the 
parameterization /3i = 0, ^2 = 0, /Ja = 1. However the basic tools and analysis we develop here 
can be adapted to other scenarios. Although we will not carry out the analysis in detail we now 
briefly indicate the main features of the different cases. 

For example, consider the following "general" parametrization of the migration, selection and 
mutation rates: 

(3.945) s^-^, /32>0, 

(3.946) mAT = /33 > 0. 

The different ranges of the migration parameters have the following interpretation. The value 
/?! = corresponds to euclidian space with dimension 2. The values < /3i < 1 correspond 
to dimensions d > 2. We do not consider this case in this paper but for a detailed discussion of 
the relation of random walks on the hierarchical group to random walks on Z*^ and for further 
references see [DGWOl] . 

We have discussed the case /3i = = and = 1 above and we will now briefly discuss the 
case (1) where /3i = but (32 > and (2) the cases f^i — (^2 ~ but either < 1 or (3^ > 1, which 
are the other most interesting cases. 

(1) In order to understand the behaviour (3i — 0, 132 > consider the behaviour of the dual 
process. Then the dual process has the property that most occupied sites have only one factor 

St 

(until the number of factors is 0{N)) and the number of factors grows like ew^. In speeded- up 
time scale N^^t the factors move quickly with possibility of coalescence each time a pair collides. 
If /^s < 1, then heuristically, emergence will occur when 

(3.947) e^=0(iV^^) 
that is. 



(3.948) t = 0{Nf^'^ log N). 
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This means that emergence occurs much later than in the critical regime considered in our work. 

(2) We now restrict our attention to the case Pi = (32 — and discuss the effect of different 
mutation rates. The case of faster mutation, < /^s < 1, has the effect that emergence can occur 
before collisions become non-negligible. In particular, if the mutation rate is with < 1 we 
get emergence at time /S^alogN and the emergence is deterministic, i.e. Var(a;^ (a/^a log A'^)) — )■ 0. 
In this case if we look at the dual process there are 0{N^~^^) occupied sites at time 0(1) and 
therefore we have this number of droplets of size N^^ at time af3^ \ogN. Therefore we have a law 
of large numbers and deterministic emergence dynamics. 

On the other hand, slower mutation, /Ja > 1, means that the "wave of advance" of the dual 
system must move to distance 2 or higher depending on the value of /Ss > 1 in order to produce 
the number of dual particles needed to guarantee emergence. 

4 Appendix. Nonlinear semigroup perturbations 

We use the following result of Marsden |MA] . (4.17). 

Theorem 1 (Perturbation) 

Let M be a Banach space and let As be the infinitesimal generator of a strongly continuous 
semigroup, with \\St\\ < Me*^* for some C. Let At : B — >■ B be a vector field on B such that At 
is of class C'^ with its first and second derivatives uniformly bounded on bounded subsets and let 
{Tt} be the flow of At- 

Then As + At has a unique flow which is Lipschitz for each t, < t < T , and 

(4.1) Vtx = lim {St/,, ■ r,/„)"x 

n^cxD 

uniformly in t for each x on hounded sets of t. If x ^ ^(^s + ^t), then 

(4.2) jVtx = {As + AT)Ut 

on [0, r) where r is the exit time from B. □ 
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